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Introduction 


Neurofibromatosis  type  1  affects  1/4000  individuals  worldwide  and  predisposes  to  the  growth  of 
both  benign  and  malignant  tumors.  We  hypothesized  that  the  early  age  at  onset  of  cutaneous 
neurofibromas  observed  in  patients  with  NF1  microdeletions  is  caused  by  the  co-deletion  of  NF1 
and  a  second  gene  NPL  (neurofibroma-potentiating  locus)(l-3).  This  hypothesis  was  supported 
by  our  findings  that  the  majority  of  NF1  microdeletion  breakpoints  are  clustered  at  large 
repetitive  elements  (NFIREPs)  that  flank  the  NF1  locus  (4).  In  this  application,  we  proposed  to 
test  the  following  hypotheses:  [1]  NF1  microdeletion  breakpoints  occur  at  a  small  segment  that 
defines  a  meiotic  recombination  hotspot(s)  within  the  15-100  kb  NF1REP  elements  and  that 
homologous  recombination  at  the  hotspot  is  facilitated  by  a  nearby  recombinogenic  element.  [2] 
Polymorphism  in  NF1REP  number,  orientation,  and/or  complexity  predisposes  certain 
individuals  to  NF1  microdeletion  and  the  consequent  high  neurofibroma  burden.  [3]  NF1 
microdeletion  increases  the  risk  of  developing  a  solid  tumor  malignancy.  [4]  NFIREP-mediated 
NF1  microdeletion  in  somatic  cells  is  an  underlying  mechanism  of  loss  of  heterozygosity  at  the 
NF1  locus  in  malignant  tumors  of  NF1  patients.  This  research  will  identify  specific  genetic  loci 
and  mechanisms  that  play  a  role  in  tumor  development  in  NF1  patients. 

Body 

Progress  described  for  items/timeline  described  in  the  original  “Statement  of  Work”  in  the  grant 
proposal. 

Year  1: 

•  Development  of  a  probe  to  detect  NFIREP-mediated  NF1  deletion  junction  fragments 

We  have  mapped  and  sequenced  the  breakpoints  of  deletion  patients  and  identified  two 
recombination  hotspots  for  recurrent  NF1  microdeletion,  along  with  mapping  and  sequencing 
three  novel  breakpoints.  We  have  developed  rapid  and  accurate  polymerase  chain  reaction  (PCR) 
assays  that  can  detect  the  recurrent  NF1  microdeletions  in  a  blood  sample  from  a  patient.  These 
results  far  exceed  our  original  statement  of  work,  when  we  envisioned  development  of  a  probe  for 
use  on  Southern  blots  to  detect  NF1  microdeletion  junction  fragment.  Together,  these  data  have 
confirmed  and  proven  our  first  hypothesis  that  NF1  microdeletion  breakpoints  occur  at  a  small 
segment  that  defines  a  meiotic  recombination  hotspotfs!  within  the  1 5-100  kb  NF1REP  elements 
and  that  homologous  recombination  at  the  hotspot  is  facilitated  bv  a  nearby  recombinogenic 
element.  These  results  are  published  in  the  attached  manuscripts  (5-7). 

Stephens  K.  Genetics  of  neurofibromatosis  1-associated  peripheral  nerve  sheath  tumors. 

Cancer  Invest,  in  press. 

Dorschner  MO,  Brems  H,  Le  R,  et  al.  Two  paralogous  recombination  hotspots  mediate 

69%  ofNFl  microdeletions,  submitted. 
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In  summary,  we  have  shown  that  the  majority  (14/17)  NF1  microdeletions  were  1.5  Mb 
in  length  and  arose  by  a  mechanism  of  recombination  between  misaligned  directly-oriented  repeat 
elements  that  flank  the  NF1  gene  [termed  NF1REP-P1  (proximal)  and  -M  (medial)]  of  about  15- 
100  kb  in  length  (4)(Figure  1  below).  These  repeats  are  paralogs,  which  are  sequences  of  high 
identity  that  arise  by  duplication  within  a  species.  This  year  we  introduced  the  term  paralogous 
recombination  to  describe  the  process  of  homologous  recombination  between  paralogs  (6).  When 
we  mapped  NF1  deletion  breakpoints  to  the  sequence  level,  we  unexpectedly  discovered  that 
46%  of  patients  with  entire  NF1  gene  deletions  (N=54)  have  breakpoints  that  map  to  a  2  kb 
recombination  hotspot  within  the  NFIREPs  (5).  A  PCR  assay  to  detect  deletion  junction 
fragments  was  developed  by  locating  the  upstream  primer  in  NF1REP-P1  and  the  downstream 
primer  in  NF1REP-M  taking  advantage  of  paralogous  sequence  variants  (PSV,  which  are 
NFIREP-specific  nucleotide  differences)  between  the  two  paralogs.  A  3.4  kb  amplicon  is 
produced  from  deleted  chromosomes  with  breakpoints  at  this  hotspot;  however,  amplification  of 
the  1.5  Mb  segment  from  normal  chromosomes  cannot  occur. 

By  mapping  and  sequencing  NF1  microdeletion  breakpoints  of  additional  patients,  we 
have  extended  the  location  of  this  original  NF1  microdeletion  hotspot  and  have  identified  a 
second  hotspot,  developed  assays  to  detect  deletions  at  both  hotspots,  and  determined  the 
frequency  of  NF1  microdeletion  that  occurs  at  these  hotspots.  These  data  are  described  in  detail 
in  the  attached  manuscript  Dorschner  et  al,  which  is  submitted  for  publication  (7)(see  attached). 
In  summary,  the  first  hotspot  was  extended  to  4.1  kb  in  length  and  designated  as  paralogous 
recombination  site  1  (PRS1).  The  second  cluster  was  6.3  kb  in  length  and  extended  the  cluster 
harboring  the  previously  defined  2  kb  recombination  hotspot.  This  cluster,  designated  as 
paralogous  recombination  site  2  (PRS2),  was  about  15  kb  telomeric  to  PRS1. 

Figure  1.  Schematic  of  the  paralogous  recombination  sites  within  the  NF1REP  paralogs.  Paralogous 
recombination  between  the  NF1REP  elements  results  in  a  recurrent  1.5  Mb  deletion  of  the  entire  NF1 
locus.  The  location  of  the  paralogous  recombination  sites,  PRS1  and  PRS2,  are  shown.  The  stippled 
portion  of  PRS2  designates  the  previously  reported  2  kb  recombination  hotspot,  while  the  hatchmarks 
designate  the  extended  portion  of  the  site  reported  here.  The  basepair  coordinates  are  relative  to  BAC 
271K11,  which  contains  NF1REP-P1. 
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We  developed  new,  rapid,  efficient  PCR  assays  that  detect  NF1  microdeletions  at  either  PRS1  or 
PRS2.  We  applied  these  assays  to  screening  a  larger  cohort  of  MF7 deletion  patients  (N=78)  and 
determined  that  18%  of  deletions  occurred  at  PRS1,  while  (51%)  occurred  at  PRS2.  As  expected, 
neither  the  7  kb  PRS1  nor  the  7  kb  PRS2  deletion-junction  assays  generated  products  upon 
amplification  of  150  normal  chromosomes.  These  assays  can  be  used  in  clinical  laboratories  for 
the  diagnosis  of  NF1  patients  that  carry  the  recurrent  1.5  Mb  NF1  microdeletion  and  in  research 
laboratories  to  identify  patients  for  phenotype/genotype  studies. 

In  addition  to  these  studies  of  germline  NF1  microdeletions,  we  have  determined  breakpoints  in 
patients  who  have  somatic  mosaicism  for  an  NF1  microdeletion  no  not  occur  at  the  PRS1  or 
PRS2  hotspots.  This  implies  that  these  microdeletions  may  occur  by  a  different  mechanism(s). 
This  is  important  to  determine  since  the  somatic  NF1  loss  that  occurs  during  tumorigenesis  may 
occur  by  a  similar  mechanism.  We  have  mapped  two  mitotic  NF1  microdeletion  breakpoint  to 
the  sequence  level  (Figure  1;  UWA186-1  and  UWA208-1),  developed  a  deletion  junction 
fragment  specific  assay,  and  are  using  that  to  determine  if  other  patients  mosaic  for  NF1 
microdeletions  have  breakpoints  clustered  at  this  location.  Manuscripts  describing  these  patients 
and  their  breakpoints  are  currently  in  preparation,  (Stephens  et  al.  and  Forbes  et  al.). 

NFIREPs  are  comprised  of  a  complex  modular  arrangement  of  paralogs  of  different 
sequence  families. 

Mapping  NF1  breakpoints  necessitated  a  detailed  understanding  of  the  structure  and 
sequence  of  the  NF1REP  paralogs  in  order  to  design  appropriate  primers  and  avoid  co¬ 
amplification  of  other  paralogs.  We  have  determined  the  structure  of  four  NF1REP  paralogs  by 
STS  mapping,  sequencing,  BLAST  and  BLAT  analyses,  and  sequence  alignments.  These  data  do 
not  identify  a  sequence-specific  motif  responsible  for  the  recombination  hotspots,  but  rather 
indicate  that  multiple  factors  contribute  to  the  location  of  the  recombination  events.  These  data 
are  detailed  in  the  attached  manuscript  Forbes  et  al.  which  has  been  submitted  for  publication  (8Y 

In  summary,  the  relative  locations  of  NF1REP-P1,  -P2,  and  -M  are  shown  in  Figure  1,  while 
the  paralog  designated  as  El  9  is  located  on  chromosome  19p.  We  are  highly  confident  of  the 
structure  of  NF1REP-P1,  NF1REP-P2,  and  El 9,  which  are  each  carried  primarily  by  a  single 
BAC  that  has  been  completely  sequenced  by  the  public  Human  Genome  Project.  Selected  PCRs 
and  sequencing  in  our  laboratory  of  BACs  640N20  and  95 IF  1 1  (Human  Genome  Project 
sequence  is  incomplete)  are  consistent  with  the  structure  of  NF1REP-M.  Accuracy  of  the 
sequence  and  structure  of  the  NFIREPs  is  further  supported  by  numerous  long  range  PCRs, 
which  were  consistent  with  the  structures  depicted  in  Figure  2  below. 
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Figure  2.  Top  panel :  The  three  paralogs  in  the  NF1  region  are  shown  oriented  from  centromere  to  telomere. 
BAC  identities  and  accession  numbers  are  shown.  Green  color  indicates  sequences  of  the  KIAA0563rel 
functional  gene  and  related  pseudogene  fragments  OP),  with  numbered  black  vertical  bars  designating  exons  or 
related,  exon-derived  sequences.  Light  green  boxes  within  KIAA0563rel  copies  indicate  the  orientation  of  a  5.8 
kb  inverted  repeat  with  two  copies  on  NF1REP-P1,  and  a  third  copy  in  NF1REP-M.  Landmark  STSs  in  the 
KIAA0563rel  gene  and  pseudogenes  cited  previously  (4)  are  indicated  above  NF1REP-P1.  Yellow  boxes  depict 
SMURF2-derived  pseudogene  fragments,  and  red  boxes  denote  the  PRS.  Lower  panel.  Comparison  of 
NF1REP-P1  to  NF1REP-E19  located  at  chromosome  19pl3.13.  Colored  boxes  match  those  described  for 
panel  B.  PRS1  and  PRS2  sites  in  NF1REP-E19  are  not  known  to  serve  as  recombination  substrates. 
NF1REP-E19  contains  the  LEC2  functional  gene  and  NF1REP-P1  contains  pseudogene  fragments  of  LEC2 
(purple). 


As  detailed  in  Forbes  et  al.  (8),  we  found  that  theThe  NFIREPs  are  complex  with  a  modular 
paralog  structure  that  is  essentially  comprised  of  mix-and-match  components.  The  principal 
sequences  responsible  for  the  two  51-kb  blocks  with  97.5%  sequence  identity  (NF1REP-P1-51 
and  NF1REP-M-51).  Here  we  show  that  these  paralogs  belong  to  a  complex  group  of  paralogs 
from  different  sequence  families  with  three  components  on  17q  and  another  at  1 9p  1 3 .  Multiple 
breakpoint  sequencing  reveals  two  paralogous  recombination  hotspots  within  the  51-kb  blocks. 
We  seek  to  explain  the  location  of  the  hotspots  based  on  sequence  motifs,  quality  of  alignment 
match,  and  nucleotide  sequence  identity.  Hotspot  locations  with  respect  to  relatively  large 
paralogous  alignment  gaps  suggest  that  the  gaps  may  a  primary  factor  localizing  pairing  and 
exchange.  Gene  conversion  tests  reveal  a  700  bp  gene  conversion  tract  at  the  predominant 
paralogous  recombination  hotspot.  Thus,  high  sequence  identity  at  a  hotspot  may  be  a  result,  as 
well  as  a  potential  cause,  of  propensity  to  pair  at  a  particular  site.  Whether  alignment  gaps  are  a 
key  breakpoint  localization  factor  may  be  eventually  assessed  by  discovery  and  comparison  of 
more  systems  that— like  the  NFIREPs— show  relatively  high  levels  of  alignment  mismatch 
between  paralogs. 

Constructing  a  fine,  sequence-based  map  of  each  NF1REP  as  described  in  Forbes  et  al.,  was 
necessary  to  facilitate  experiments  to  test  our  second  hypothesis  that  polymorphism  in  NF1REP 
number,  orientation,  and/or  complexity  predisposes  certain  individuals  to  NF1  microdeletion  and 
the  consequent  high  neurofibroma  burden.  We  are  currently  examining  the  NF1REP-P1  and 
NF1REP-M  structure  and  sequence  of  the  parental  chromosomes  that  underwent  recombination 
to  create  the  de  novo  NF1  microdeletion  in  the  affected  child.  We  have  constructed  human-rodent 
somatic  cell  hybrids  to  separate  the  two  chromosomes  17  of  each  of  the  parents.  Analysis  of  the 
such  hybrid  lines  has  identified  the  two  chromosomes  that  were  substrates  for  the  recombination. 
Analysis  of  the  NF1REP-P1  and  NF1REP-M  of  these  chromosomes  is  currently  underway. 
Originally,  we  simply  amplified  NF1REP  regions  from  the  genomic  DNA  of  each  parent,  cloned 
the  amplimers  and  sequenced  the  region.  However,  we  found  unusual  and  inconsistent  sequences 
that  could  not  be  easily  explained.  To  determine  if  they  were  from  polymorphism  or  unusual 
NF1REP  structure,  we  embarked  on  the  somewhat  arduous  task  of  constructing  the  hybrid  lines. 
Direct  sequencing  of  NF1REP  from  these  lines  will  avoid  cloning  artifacts  and  is  expected  to 
determine  the  precise  structure  of  the  parental  NF1REP  recombination  substrates.  In  addition, 
we  are  beginning  to  screen  hybrid  lines  of  normal  individuals  and  genomic  DNA  of  normal 
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individuals  for  polymorphism  in  the  NF1REP  regions  by  Southern  blot  analysis  and  selected 
PCR  assays. 

Gene  and  transcript  map  of  the  NF1  microdeletion  region. 

In  important  resource  for  understanding  the  genetic  basis  of  the  more  severe  phenotype 
observed  in  NF1  microdeletion  patients  is  a  complete  physical  map  of  the  NF1  deletion  region  at 
chromosome  17ql  12.  In  collaboration  with  Dr.  Dieter  E.  Jenne  (Max-Planck-Institute  of 
Neurobiology),  we  have  combined  our  map  and  sequence  data  and  constructed  a  physical  and 
transcript  map  of  the  >2  Mb  NF1  microdeletion  region.  These  data  are  detailed  in  the  attached 
manuscript  f9T 

Years  2  and  3: 

•  Ascertainment  and  sample  collection  of  NF1  patients  with  tumors 

•  Employ  the  PRS1  and  PRS2  NF1  gene  dosage  assays  to  determine  if  the  frequency  of 
NF1  microdeletions  is  greater  in  patients  that  develop  malignancies 

On  May  12,  2003,  the  USAMRMC  Human  Subjects  committee  finally  approved  our 
protocol.  The  difficulty  of  getting  this  relatively  simple  protocol  approved  by  this  committee 
has  been  a  major,  and  unexpected,  disappointment  of  this  study.  To  facilitate  completion  of  this 
portion  of  the  study,  we  have  a  one  year  no-cost  extension  for  this  grant  award. 

We  have  identified  430  adults  with  malignant  peripheral  nerve  sheath  tumors  (MPNST) 
that  were  referred  to  the  University  of  Washington  Medical  Center  (UWMC).  Careful 
examination  of  the  medical  records  revealed  that  41  had  neurofibromatosis  1 .  We  have  sent 
letters  to  their  UWMC  care  providers  and/or  their  primary  referring  providers  to  determine  if  the 
patients  are  alive  and  to  forward  a  letter  to  them  to  determine  if  they  are  interested  in 
participating  in  our  study.  In  addition,  we  have  ascertained  17  children  affected  with  NF1  who 
had  solid  tumor  malignancies,  primarily  optic  glioma  and  MPNST.  We  are  currently  preparing 
materials  to  be  sent  to  their  care  providers  to  determine  interest  in  our  study. 

In  addition,  we  are  screening  the  tumor  database  at  UWMC  to  identify  NF1  patients  who 
had  solid  malignancies  other  than  MPNST.  If  sufficient  patients  are  identified,  we  will  also 
screen  them  for  NF1  microdeletions. 

Year  3: 

Genotype  grandparents  to  determine  meiotic  mechanism  of  microdeletion 

For  the  following  reasons,  we  have  decided  not  focus  on  this  one  small  aspect  of  our 
study.  We  will  proceed  to  collect  grandparental  samples,  but  this  aim  may  not  be  accomplished 
during  the  grant  period.  The  reasons  are: 

1.  We  have  little  time  remaining  to  obtain  grandparental  samples  because  of  the  extended 
length  of  time  it  took  to  obtain  approval  from  the  USAMRMC  Human  Subjects 
committee, 

2.  With  the  limited  time  we  have  left  to  accomplish  the  work,  we  have  decided  to  focus  our 
efforts  the  phase  of  the  study  regarding  risk  of  malignancy 
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3.  This  aim  is  of  less  importance  because  another  research  group  has  investigated  and 

published  work  related  to  this  question  (10). 

Employ  NF1  microdeletion  assays  to  detect  NFIREP-mediated  deletion  junction 
fragment  in  tumor  tissue  of  NF1  patients 

Hypothesis  number  4  that  NFIREP-mediated  NF1  microdeletion  in  somatic  cells  is  an 
underlying  mechanism  of  loss  of  heterozygosity  at  the  NF1  locus  in  malignant  tumors  of  NF1 
patients  awaits  our  obtaining  such  tissues  for  analysis  now  that  our  Human  Subject  protocol  has 
been  approved  by  the  Army. 

However,  this  hypothesis  has  been  disproven  for  benign  neurofibromas.  Using  the  PRS1 
and  PRS2  deletion  junction-specific  amplification  assays,  we  have  screened  DNA  from  171 
cutaneous  and  plexiform  neurofibromas  and  have  not  detected  NF1  microdeletions  at  these 
recombination  hotspots.  If  somatic  paralogous  recombination  is  a  mechanism  of  loss  of 
heterozygosity  at  NF1  during  neurofibromagenesis,  it  occurs  at  novel  sites. 

Development  of  new  quantitative  PCR  assays  to  detect  novel  NF1  microdeletions 
that  occur  in  germline  or  somatic  tissues. 

Since  approval  of  our  Human  Subjects  protocol  was  delayed  much  longer  than  we 
anticipated,  we  made  valuable  use  of  our  time  by  developing  new,  highly  sensitive,  quantitative 
PCR  assays  to  detect  NF1  microdeletions  of  any  length  in  either  germline  or  somatic  tissues. 

These  assays  required  the  capability  and  sensitivity  to  determine  if  a  locus  or  gene  has  no 
deletion  (2  copies),  one  gene  deletion  (1  copy),  two  gene  deletion  (0  copies),  or  amplification 
(likely  >10  copies).  Two  gene  deletion  and  gene  amplification  are  simple  to  detect  using  various 
methods  of  quantitative  PCR.  The  difficulty  is  in  reliably  differentiating  one  versus  two  copies 
of  a  gene.  Therefore,  we  have  been  focusing  on  developing  the  best  assay  to  detect  a  one  gene 
deletion.  To  determine  which  quantitative  PCR  methods  were  sensitive,  we  choose  to  develop  an 
assay  at  intron  31  of  the  NF1  gene.  To  validate  the  assays  we  used  DNA  from  normal 
individuals  (2  NF1  genes)  and  DNA  from  NF1  patients  with  deletions  involving  the  NF1  gene  (1 
NF1  gene).  Initially,  we  tried  using  SYBR  green  (binds  double  stranded  DNA)  fluorescence  as  a 
method  of  detection  during  real-time  PCR  in  the  LightCycler  instrument  (Roche).  There  was 
considerable  overlap  between  the  crossing  point  values  (Ct)  for  samples  with  one  and  two  NF1 
genes  indicating  that  the  assay  was  not  as  sensitive  or  specific  as  required  (data  not  show). 
Secondly,  we  sought  to  increase  specificity  and  sensitivity  by  using  a  NF1-  specific 
fluorescently-labeled  primer.  We  constructed  a  LUX  primer  (Invitrogen),  which  is  a  hybrid 
primer  comprised  of  NF1  specific  sequences  and  anonymous  sequences  that  are  capable  of  fold- 
back  annealing.  LUX  primers  are  touted  as  having  high  specificity  because  they  only  fluoresce  at 
high  temperatures  when  the  fold  backs  are  melted.  We  had  multiple  problems  with  LUX  primers 
and  decided  to  abandon  that  approach  (data  not  shown).  The  third  method  we  developed  was 
precise,  sensitive,  and  specific  and  involves  SYBR  green  for  detection,  competitive  quantitative 
PCR,  and  melting  curve  analysis  as  detailed  below. 
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We  chose  to  employ  SYBR  green  for  detection  in  combination  with  competitive 
PCR,  which  is  the  most  suitable  method  of  quantification  when  highly  accurate  determinations 
are  required.  We  adapted  and  modified  a  method  published  by  Ruiz-Ponte  et  al.  (1 1).  In  this 
method,  a  known  copy  number  of  a  competitor  is  introduced  directly  in  the  PCR  mixture  along 
with  the  target  DNA  of  the  patient/tumor.  The  competitor,  which  is  almost  identical  to  the 
target  DNA  but  distinguishable  by  product  length,  is  amplified  with  the  same  set  of  primers  so 
that  efficiency  of  amplification  for  the  two  amplicons  is  the  same.  Calibration  curves  of  different 
competitor  concentrations  determines  the  optimal  concentration  that  equals  that  of  the  target 
DNA.  Figure  3  below  shows  the  melting  curves  of  a  normal  control  DNA  samples  (2  copies  of 
NF1),  where  competitor  and  intron  3 1  are  co-amplified  with  equal  efficiency  and  the  area  under 
the  curves  are  equal  (roughly  equivalent  to  peak  height  in  this  example).  We  constructed  the 
competitor  such  that  it  would  be  amplified  with  the  intron  31  primers,  but  have  a  different 
melting  curve  by  replacing  an  internal  TTT  sequence  with  a  CCC  sequence.  As  expected, 
negative  samples  lacking  human  DNA  did  not  amplify. 

Figure  4  below  shows  the  melting  curve  of  a  patient  with  an  NF1  deletion  (1  copy  of 
NF1)  versus  that  of  the  normal  control  individual.  Note  that  the  amplitude  of  the  melting  curve 
of  the  NF1  amplicon  for  the  deletion  patient  is  less  that  of  the  competitor  amplicon  because  there 
are  fewer  targets  in  the  deleted  patient’s  DNA.  For  precision,  we  use  the  peak  areas  of  each 
melting  curves  for  quantitation  rather  than  peak  height.  A  ratio  of  peak  area  of  normal  control  (2 
copies)  over  the  peak  are  of  the  patient  target  DNA  is  calculated,  see  below. 

Figure  3.  Melting  curve  analysis  after  competitive,  quantitative  PCR  at  NF1  intron  31  in  genomic  DNA 
of  a  normal  control  individual.  The  peaks  representing  the  melting  curve  of  the  amplicon  of  the  competitor 
and  the  amplicon  of  the  patient’s  target  DNA  are  indicated.  The  negative  control  without  DNA  shows 
evidence  of  amplificiation. 

Figure  4.  Melting  curve  analysis  after  competitive,  quantitative  PCR  at  NF1  intron  31  can  differentiate  one 
gene  copy  versus  two  gene  copies.  The  results  of  two  reactions  are  shown,  closed  circles  represent  target 
DNA  from  a  normal  control  individual  and  closed  squares  represent  target  DNA  from  an  NF1  patient  with  a 
deletion  of  one  gene.  The  peaks  representing  the  melting  curve  of  the  amplicon  of  the  competitor  and  the 
amplicon  of  the  patient’s  target  DNA  are  indicated.  The  negative  control  without  DNA  shows  evidence  of 
amplification. 

Once  the  concentration  of  competitor  is  determined  for  a  certain  concentration  of  the 
normal  control  DNA,  it  is  essential  that  all  subsequent  reactions  with  unknown  patient  DNA 
samples  contain  exactly  the  same  concentration  of  target  DNA.  Prior  to  the  competitive 
quantitative  PCR  assay,  we  determine  the  exact  concentration  of  each  patient  sample  using  real¬ 
time  quantitative  PCR  at  a  different  locus.  We  amplify  the  TP  A  (tissue  plasminogen  activator) 
gene  on  chromosome  12  in  each  patient  and  compare  that  to  a  standard  curve  using  the  normal 
control  DNA.  From  this  reaction,  we  can  calculate  exactly  what  volume  of  patient  DNA  must  be 
added  to  the  competitive  quantitative  PCR  assay.  An  example  of  the  TPA  real-time  PCR  and 
standard  curve  is  shown  in  Figure  5. 
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Figure  5.  LightCycler  real-time  PCR  at  TPA  locus  showing  standard  curve.  The  upper  panel  shows  the 
results  of  real-time  PCR  of  the  TPA  locus  of  a  dilution  series  of  a  normal  control  individual.  The  reaction 
consists  of  unlabeled  primers  and  uses  an  internal  labeled  (fluorescence  resonance  energy  transfer  (FRET) 
probe  for  detection  of  product.  The  crossing  point  (Ct)  is  defined  as  the  fractional  cycle  at  which 
fluorescence  begins  to  increase  exponentially  and  is  calculated  by  the  LightCycler.  Ct  becomes  larger  as 
the  number  of  TPA  targets  decreases.  The  lower  panel  shows  the  standard  curve  calculated  from  the  above 
data.  Note  the  low  error  and  high  correlation  coefficient. 

The  competitive  quantitative  gene  dosage  assay  we  have  developed  for  intron  31  of  the  NF1  gene  is 
very  precise  and  shows  minimal  variability.  Figures  6  and  7  below  shows  melting  curves  for  6 
replicates  of  an  NF1  deletion  patient  and  a  normal  control  individual,  respectively. 

Figure  6.  Melting  curves  of  competitive  quantitative  PCR  at  NF1  intron  31 

of  6  replicates  of  an  NF1  deletion  patient.  Six  curves  (color  replaced  by  grey  tones  for  this  report)  are 
overlaid  to  demonstrate  the  precision  and  lack  of  variability  in  replicate  samples. 

Figure  7.  Melting  curves  of  competitive  quantitative  PCR  at  NF1  intron  31 

of  6  replicates  of  a  normal  control  individual.  Six  curves  (color  replaced  by  grey  tones  for  this  report)  are 
overlaid  to  demonstrate  the  precision  and  lack  of  variability  in  replicate  samples. 
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These  data  will  be  analyzed  as  shown  in  the  tables  below.  First,  note  the  precision  of  the  assay; 
the  standard  deviations  are  only  2%  and  3%  of  the  means  of  the  assay  for  normal  control 
individuals  and  deletion  patients,  respectively.  As  expected  the  ratio  of  NF1: competitor  peak 
areas  is  essential  1.0  for  samples  with  two  copies  of  the  NF1  gene  and  is  0.5  for  samples  with 
one  copy  of  the  NF1  gene.  We  have  screened  a  large  collection  of  NF1  deletion  patients  and 
normal  individuals  and  found  that  there  is  no  overlap  of  ratio  values  between  these  two 
populations. 

Therefore,  we  have  developed  a  sensitive  and  specific  assay  that  differentiates  reliably 
between  one  versus  two  copies  of  a  gene.  We  are  now  applying  this  same  assay  to  determine  its 
ability  to  quantitate  amplified  loci. 


No. 

Area  of  NF1 
peak 

Area  of 
Competitor 

Peak 

Ratio 

Mean 

Ratio 

S.D. 

i 

1.937 

3.85 

0.503 

2 

2.103 

3.68 

0.571 

3 

1.952 

3.755 

0.519 

4 

2.052 

3.356 

0.611 

5 

1.851 

3.665 

0.505 

6 

1.934 

3.33 

0.580 

0.548 

0.0185 

We  are  currently  developing  similar  quantitative  PCR  assays  for  NF1  exon  5  and  two  loci  that 
flank  the  gene  (AN  and  AH).  These  assays  can  be  used  together  to  identify  and  crudely  map  the 
extent  of  new  NF1  microdeletions. 

New  grant  awarded  based  on  this  research: 

4/7/03-4/6/07  US  Army  Medical  Research  &  Materiel  Command  DAMD  17-03-1-0203 

Clinical  and  Molecular  Consequences  of  NF1  Microdeletion 
Principal  Investigator:  Karen  Stephens,  PhD 


Key  Research  Accomplishments 

•  Identification  of  two  recombination  hotspots  where  69%  of  germline  NF1  microdeletions 
occur.  This  is  important  because  1)  these  hotspots  can  be  analyzed  in  detail  to  investigate 
why  recombination  is  favored  at  these  sites  and  2)  the  majority  of  microdeletion  patients  will 
have  virtually  the  same  genotype  and  will  therefore  constitute  an  important  patient  cohort  for 
genotype/phenotype  analyses. 

•  Development  of  simple  and  reliable  PCR  assays  that  detect  the  presence  of  the  recurrent  1 .5 
Mb  germline  NF1  microdeletion  in  a  patient  blood  sample. 

•  Showed  that  loss  of  heterozygosity  in  benign  neurofibromas  does  not  commonly  occur  at  the 
NF1  microdeletion  hotspots  found  in  germline  NF1  microdeletions 

•  Somatic  mosaic  microdeletions  occur  at  different  sites  than  germline  microdeletions.  This 
implies  that  they  occur  by  a  novel  mechanism,  which  may  also  apply  during  tumorigenesis. 

•  A  detailed  structural  and  sequence  analysis  of  the  NF1  paralogs,  NF 1 REP-P 1 ,  P2,  and  M, 
which  identified  factors  that  may  contribute  to  the  recombination  at  hotspots  PRS1  and 
PRS2. 

•  Developed  a  specific  and  sensitive  competitive,  quantitative  PCR  methodology  that  can 
differentiate  one  copy  from  two  copies  in  a  genome.  Employed  real-time  PCR  of  the  TPA 
locus  as  a  means  of  precise  DNA  quantification.  Validated  the  competitive,  quantitative  PCR 
methodology  by  assay  of  the  NF1  locus  in  normal  control  individuals  and  NF1  deletion 
patients. 
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•  Documented  the  reproducibility  of  the  competitive,  quantitative  PCR  method. 

Reportable  Outcomes 

Manuscripts.  Peer-reviewed,  primary  research: 

Messiaen  L,  Riccardi  V,  Peltonen  J,  Maertens  O,  Callens  T,  Karvonen  SL,  Leisti  E-L, 
Koivunen  J,  Vandenbroucke  I,  Stephens  K,  Poyhonen  M.  Type  of  NF1 
mutations  in  two  large  families  with  spinal  neurofibromatosis  cannot  account  for 
the  clinical  homogeneity  in  severity  of  the  disease.  J  Med  Genet  40: 122-126, 
2003. 

Jenne  DE,  Tinschert  S,  Dorschner  MO,  Hameister  H,  Stephens  K,  Kehrer-Sawatzki  H. 
Complete  physical  map  and  gene  content  of  the  human  NF1  tumor  suppressor 
region  in  human  and  mouse.  Genes  Chromosomes  Cancer  37:1 11-20, 2003 
Fishbein  L,  Sanek  N,  Stephens  K,  Wallace  MR.  An  association  between  the  ERBB2 
codon  655  polymorphism  and  NF1.  Am  J  Med  Genet,  submitted. 

Dorschner  MO,  Brems  H,  Le  R,  et  al.  Two  paralogous  recombination  hotspots  mediate 
69%  of  NF1  microdeletions,  submitted. 

Forbes,  S.H.,  Dorschner,  M.O.,  Le,  R.  and  Stephens,  K.  Genomic  context  of  paralogous 
recombination  substrates  mediating  the  recurrent  NF1  region  microdeletion. 
Genome  Res,  submitted. 

Invited  Reviews: 

Stephens  K.  Genetics  of  neurofibromatosis  1 -associated  peripheral  nerve  sheath  tumors. 
Cancer  Invest,  in  press  (Dec,  2003). 

Book  Chapters: 

Stephens  K.  Neurofibromatosis.  In  Molecular  Pathology  in  Clinical  Practice,  Eds. 

D.G.B.  Leonard,  A.  Bagg,  A.  Caliendo,  K.  Kaul,  K.  Snow-Bailey,  V.  Van  Deerlin, 
Springer- Verlag,  in  press. 

Abstracts 

Stephens,  K.  About  70%  of  NF1  microdeletions  are  recurrent  and  occur  at  discrete 
recombination  sites  within  the  flanking  NF1REP  paralogs,  which  are  complex  modular  assemblies 
of  low-copy  repeats  of  different  sequence  families.  National  Neurofibromatosis  Foundation 
International  Consortium  on  Gene  Cloning  and  Gene  Function  of  NF1  and  NF2.  Aspen,  CO, 
June,  2003. 

Conference  speaker,  invited: 

National  Neurofibromatosis  Foundation  International  Consortium  on  Gene  Cloning  and  Gene 
Function  of  NF1  and  NF2.  “Megabase  Deletions”,  Aspen,  CO,  June,  2003. 

Invited  speaker  to  lav  organizations.  Community  Service 

NF1  Roundtable  Discussion  and  Symposium,  Washington  Chapter  of  the  National 
Neurofibromatosis  Foundation,  Seattle,  WA,  May  10,  2003,  Panelist 
NF1  Research.  Illinois  Neurofibromatosis,  Inc.,  Chicago,  IL,  October  19, 2003. 
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Development  of  clinical  diagnostic  assays  to  detect  NF1  microdeletion. 

Simple  and  reliable  assays  were  developed  to  detect  the  presence  of  an  NF1  microdeletion  in 
a  patient  blood  sample.  Using  two  different  polymerase  chain  reaction  assays,  we  can  detect 
69%  of  NF1  microdeletions.  We  have  transferred  the  technology  and  resources  for  these 
assays  to  the  UWMC  Genetics  Laboratory  and  an  NF1  microdeletion  clinical  test  is  now 
available  to  physicians  worldwide. 

Development  of  research  assays  to  detect  NF1  microdeletion 

Development  of  competitive,  quantitative  PCR  assays  for  differentiating  one  versus  two 
copies  of  NF 1  gene  intron  31. 

Training 

Stephen  H.  Forbes,  PhD,  is  being  trained  in  the  molecular  genetics  of  neurofibromatosis  1. 
Bingbing  Wang,  PhD,  MD,  Postdoctoral  Fellow,  is  being  trained  in  the  molecular  genetics  of 
neurofibromatosis  1. 

Funding  Awarded  based  on  work  supported  bv  this  award 

4/7/03-4/6/07  US  Army  Medical  Research  &  Materiel  Command  DAMD  17-03-1-0203 

Clinical  and  Molecular  Consequences  of  NF1  Microdeletion 
Principal  Investigator:  Karen  Stephens,  PhD 

Conclusions 

Neurofibromatosis  type  1  affects  1/4000  individuals  worldwide  and  predisposes  to  the  growth  of 
both  benign  and  malignant  tumors.  Genetic  factors,  in  addition  to  defects  in  the  NF1  gene  itself, 
clearly  play  a  role  in  tumor  development.  Our  research  is  focused  on  identifying  specific  DNA 
sequences  and  genetic  mechanisms  important  in  the  development  of  cutaneous  neurofibromas, 
which  occur  in  virtually  all  NF1  patients,  and  in  the  development  of  solid  malignancies.  We  have 
analyzed  NF1  microdeletions  that  are  associated  with  an  early  onset,  and  subsequent  heavy 
burden,  of  cutaneous  neurofibromas.  We  determined  the  mechanism  by  which  these  deletions 
arise  and  identified  recombination  hotspots  where  69%  of  NF1  microdeletions.  We  developed 
developed  robust  and  sensitive  assays  to  detect  microdeletions  in  a  patient  blood  sample;  these 
assays  can  be  directly  applied  in  clinical  diagnostic  laboratories.  We  identified  four  NF1REP 
paralogs,  analyzed  their  structure  and  sequence,  and  identified  unique  features  that  may  mediate 
recombination  at  these  sites  and  make  some  chromosome  susceptible  to  NF1  microdeletion.  In 
addition,  we  found  that  the  NF1  microdeletions  in  some  patients  occurred  early  during  embryonic 
development  resulting  in  somatic  mosaicism.  We  developed  quantitative  PCR  assays  to  detect 
novel  NF1  microdeletions  that  occur  either  in  the  germline  or  in  somatic  tissues.  An 
understanding  of  how  NF1  microdeletions  occur,  whether  some  individuals  are  more  susceptible, 
and  why  they  potentiate  the  development  of  neurofibromas  is  important  for  patient  care,  genetic 
counseling,  and  the  design  of  effective  pharmacological  intervention  strategies. 
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Independent  NFI  mutations  in  two  large  families  with 
spinal  neurofibromatosis 

i  Mocciaen  V  Riccardi  J  Peltonen,  O  Maertens,  T  Callens,  S  L  Karvonen,  E-L  Leisti, 
J  Koivunen'  I  Vandenbroucke,  K  Stephens,  M  Poyhonen . 
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The  neurofibromatoses  are  a  group  of  neurocutaneous  dis¬ 
orders  that  show  extreme  clinical  heterogeneity  and  are 
characterised  by  growth  abnormalities  in  tissues  derived 
from  the  embryonic  neural  crest.' 2  Hvo  main  clinical  form 
exist  type  1  (NF1 )  and  type  2  (NF2 ),  as  well  as  several  alter¬ 
nate  and  related  forms.2  ’  NF1  and  NF2  are  the  only  clinically 
well  defined  disorders  and  both  genes  have  been  identifi  . 

The  NIH  diagnostic  criteria  for  NF1,  as  defined  by  t  e  con  er 
ence  statement.’  are  met  if  two  or  more  of  the  foUowmgarc 
found:  six  or  more  CAL  spots;  two  or  more  neurofibromas  of 
any  type  or  one  plexiform  neurofibroma;  axillary  or  ingum 
freckling;  optic  glioma;  two  or  more  Lisch  nodules,  a  dlst“ 
osseous  lesion;  a  first  degree  relative  (parent,  sib,  or  offspring) 
with  NF1  according  to  the  above  criteria. 

Spinal  nerve  sheath  tumours  are  described  as  symptomatic 
findings  in  only  5%  of  NF1  patients,10  although  they  can  be 
observed  by  MRI  in  up  to  36%  of  patients."  '  The  presence  of 
a  wide,  symmetrical  distribution  of  spinal  neurofibrom  , 
occurring  in  all  adult  affected  members  of  the ^same  family 
and  segregating  in  an  autosomal  dommant  fashion 
however  extremely  rare.  This  form,  familial  spmalNF  (FSNF 

has  been  considered  an  alternate  form  of  NF  srnce  patients 
generally  lack  dermal  neurofibromas  and  Lisch  nodules  both 
typical  hallmarks  of  NF1,  and  since  symptomatic  and  genera  - 
ised  spinal  neurofibromas  are  uncommon  m  classical  NFI. 
FSNF  has  been  reported  in  only  four  families^  Three  mu- 
tigenerational  families  with  spinal  neurofibromas  and  CAL 
spots  were  shown  to  be  linked  to  markers  surrounding  the 
NF1  locus.'2  M  "  In  the  fourth  family,  presenting  with  spinal 
neurofibromas  without  CAL  spots,  linkage  to  the  NFi I  locus 
was  excluded."  Only  in  one  FSNF  family  has  the  underlying 
molecular  defect  been  documented  so  far,  which  was  a 
unique  frameshift  mutation  8042insA  in  NFI  exon  46.  Here 
we  describe  the  identification  of  the  ; NF2  « nutation  m  bo* 
remaining  FSNF  families  originally  described  by  Pulst  et  al 
and  Poyhonen  el  al' 2  Our  current  findings  emphasise  that 
FSNF  (with  CAL  macules)  is  caused  by  mutations  in  the  NFI 
gene,  but  does  not  support  the  hypothesis  that  it  is  caused  by 
a  unique  type  of  NFI  mutation. 

MATERIALS  AND  METHODS 

The  pedigrees  of  the  families  studied  are  shown  in  figs  1 A  and 

2  Skin  samples  of  controls  were  obtained  from  operations 
carried  out  for  cosmetic  reasons  from  four  healthy  petsons  at 
the  Department  of  Dermatology,  University  of  Oulu  Finland 
with  the  consent  of  the  Ethical  Committee  of  Oulu  University 
Hospital. 

NFI  mutation  analysis 

Epstein-Barr  virus  (EBV)  lymphoblastoid  cell  cultures  from 
fwo  affected  members  of  family  1  and  fibroblast  cell  cultures 
from  two  affected  members  of  family  2  were  treated  with  and 


Key  points 


Familial  spinal  neurofibromatosis  (FSNF)  is  considered 
to  be  an  alternate  form  of  neurofibromatosis,  with  a  lim¬ 
ited  phenotype  of  multiple  spinal  nerve  root  reurofibro- 
mas  and  cafe  au  lait  (CAL)  macules  in  all  effected 
adults.  Only  three  multigeneration  families  with  FSNF 
have  been  reported  and  in  one  of  them  the  genetic 
defect  was  identified  previously;  8042insA  in  exon  46 

of  the  NFI  gene. 

»  As  NFI  is  notorious  for  its  extreme  phenotypic  variabil¬ 
ity  even  within  the  same  family,  the  striking  homogene¬ 
ity  of  the  phenotype  in  these  families  suggests  that  a 
particular  type  of  mutation  might  underlie  this  condition. 

.  We  identified  the  NFJ  mutation  in  the  remaining  two 
FSNF  families  whose  phenotype  was  known  to  be  linked 
to  chromosome  V7ql  l.2  markers.  In  the  first  family  a 
novel  substitution  at  the  splice  donor  site  of  exon  4V 
(7126+3A>C),  leading  to  skipping  of  exon  39,  was 
found.  In  the  second  family,  a  missense  mutation  135/T 

in  exon  8  was  identified  as  the  sole  .alterahon.  The 

mutation  segregates  with  the  disease  in  both  families. 

•  The  current  findings  emphasise  that  FSNF  (with  CAL 
macules)  is  caused  by  mutations  in  the  NFI  gene,  but 
does  not  support  the  hypothesis  that  it  is  caused  by  a 
specific  type  of  NFI  mutation.  It  is  anticipated  tha 
expression  studies  and  mutation  °natysis  in  the  first 
instance  of  the  genes  tightly  linked  to  the  NF I  locus,  in 
both  families  may  lead  to  the  detection  of  (a)  moditier(s) 
for  this  specific  phenotype. 


without  puromycin  before  total  RNA  extraction  as  described 
previously.'6  DNA  was  extracted  from  EBV  cell  cultures  or 
fibroblasts  from  all  family  members. 

The  optimised  FIT  for  the  entire  coding  region  was  applied 
essentially  as  described  previously  with  abnormal  fragments 
further  analysed  by  cDNA  and  genomic  sequencing.’6  Then, 
for  both  families,  the  entire  NFI  cDNA  was  sequenced  using 
dye-primer  chemistry  on  an  automatic  genetic  analyser 
(ALFexpress).  Information  on  the  sequencing  primers  used  is 
available  upon  request. 

Western  blot  analysis 

Western  blot  analysis,  using  an  anti-NFl  antibody 
(NFIGRD(D))  (Santa  Cruz  Biotechnology  Ir^  Santa  Cruz 
CA)  and  peroxidase  linked  donkey  anti-rabbit  (NA  934) 
(Amersham  International  pic.  Little  Chalfont,  Buckingham¬ 
shire,  England)  as  a  secondary  antibody,  was  earned  out 
essentially  as  previously  described.17 
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baiW+t**  65,39  ■'*' 
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exon  39  is  seen  in  the  patient. 


c.  „Q  ,  pAriiaree  of  family  2  with  mutation  L357P.  In  the 
with  CAL  spots  only,  and  white  symbols  denote  healthy  sublet  . 

RESULTS 

fnbotlf  families  all  affected  adults,  except  for  one  45  year  old 
female  in  family  1  (III.3)  showed  multiple  symmetrically  dis¬ 
puted  spinal  neurofibromas  in  the  cervical  ta«j  « 
lumbar  region  (fig  3).  All  12  affected  members  m  both  faimlie 
‘also  had  more  than  six  (size  over  15  mm)  MBwaCtt 
snots  but  no  iris  Lisch  nodules  were  present.  Cluneal  data, 
originally  described  by  Pulst  et  at"  and  Foyhonen  et  at,  have 
been  updated  and  are  summarised  in  table  1 . 

ffi  faffilyl  PTT  analysis  of  the  NFl  cDNA  of  patients  ffl.2  and 
in.Vencompassing  exons  35  to  49,  showed  in  addition  to  the 


Figure  3  Coronal  T1  weighted  norvconlrast  MR  ™°9es  ]Sjmens 
ns  1  nT  TR  500  TE  15,  slice  thickness  3  mm}  ot 

parents  1  2, 11.2,'  11.3,  and  11.5  (P=proband)  from  family  2,  carrying 
L  mutation  L357P.  (A)  Cervical  spine,  (B)  Thoracic  spine,  (C) 
l  mknr  tn'me  Several  extradural,  intraspinal  neurofibromas 

tumours  ore  marked.  11.2  also  has  a  right  sided  paraspnral  thoracic 
mass  (white  arrow).  The  intro-  and  extraspinal  components  ot  a 
dumb-bell  tumour  in  the  lumbar  spine  (thick  black  arrow)  in  patien 
1U3  are  shown. 

wild  type  68,5  kDa  product  a  truncated  polypeptide  of 
anproxLiately  19  kDa,  suggesting  a  truncating  mutation 
beWecrTnudcotides  6900  and  7200  (fig  IB)  Sequencing  of 
the  KT-PCR  product  showed  absence  of  the  who  e  o  ex°n 
(fig  1C).  By  genomic  DNA  sequencing,  a  novel  substitution, 
7126+3A>C  (fig  1C),  was  found  at  the  splice  donor  site  of 
exon  39  This  is  a  novel  mutation  not  identified  previously  in 
classical  NFl  patients.  We  calculated  the  sphee  ^donor 
strength  from  the  wild  type  sequence  (7126+3A)  and  the 
mutant  sequence  (7126+3C)  using  the :  algorithms  dwdoped 
by  Shapiro  and  Senepathy"  (S&S)  and  usmg  the  Sphce  Site 
Prediction  by  Neural  Network  (SSPNN  http.// 
www  fruit  fly  ora's  eqtools/splice.html )  program.  The  scores 
rSfmSquVnce  (69^  (S&S)  and  0.09  (SSPNN),  were 
significantly  lower  than  the  wild  type  sequence  scores  (79.9 
(S&S)  and  0.82  (SSPNN)).  Hence,  less  efficient  bmdmg  °f 
splice  factors  at  the  mutant  splice  donor  site  wdl  modulate  the 
splicing  efficiency  leading  to  skipping  of  exon  39,  In  order  to 
exclude  the  presence  of  a  putative  second  alteration  in  the  NFl 
gene  the  complete  NFl  cDNA  was  sequenced  using  direct 
cycle  sequencing,  but  no  other  pathogenic  alteration  was 
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PT T  analysis  of  *e  total  coding  8^' sequendng  of  the 
of  a  truncating  mutation.  La  ,  missense  mutation  was 

total  NF1  cDNA  was  performed  and  a #t 
identified  in  exon  8  as  the  sote  ah  ratiomch an, ^  ^  „  g 

amino  acid  357  to  prolm  (  the  mutation  L357P, 

III.l,  IIX-3,  and  III.4  were  shown  to  bearm  ^  ^  ^  ^ 

whereas  the  healthy  fanuly  tbat  the  mutation 

carry  this 

segregates  with  thedlseaS  n  al  chr0mosomes.  L357  is  also 
not  found  by  analysmg  200  «  m  (L26501). 

conserved  in  Fuju  rutope  a  very  rigid 

mouse  (L10370),  and  *  <  creates  a  fixed  kink  in  a  polypep- 

amino  acid  and  its  P^n“  dramatic  consequences  for  the 

sassssst  “ u5,r  “ 

the  pathogenic  N FI  mutation  m  this  family. 

Western  blot  analysis  subjects  and  three 

Four  fibroblast  cell  cultures  independently  from  the 

fibroblast  cell  cultures  ini  *a  tha  missense  mutation 

normal  skin  of  patient ,  d  L  western  transfer  analysis. 

L357P<famUy2  ^igvelsof  If  protein  were  slightly  higher 
This  showed  that  the  levels  oil  v  normal  sub ects 

in  three  out  of  four  fibroblast  cell  fines  tro  not 

compared  to  fibroblasts  cultured  from  patient  1 
shown). 


NFl  because  of  the  exceptional  occmrenao^etrica^  ^  ^ 

neurofibromas  affecting  JP“a  family  accompanied  by 

affected  adult  members  of  the  same  »absenc/of  Usch 

very  mild  cutaneous  s'gns..,  nte(j  families  have  been 

mdnlSTVc£Zts  w«e “emd  Se  sole  cutaneous 
described  and  CAL  spot  l2 ,4  „  ^^ugh  the  clmi- 

sign  present  in  all  ^  S  the  NIH  crite- 

cal  manifestations  m  e  spots,  two  neurofi- 

ria  for  NF1  (presence  of  mote  than  six  CAL^ 

bromas  of  any  type,  in  this  ca  offspring)  with 

and  also  a  first  degree  dermal 

NF1),  they  are  very  in  more  than 

neurofibromas,  and  Lis  classical  NF1  patients  but 

80%,  90%,  and  90%  ted  "members  of 

these  signs  were  generally  absea{fected  member  had  freckles 
the  FSNF  families  stu  ■  dermal  neurofibromas,  but  no 

in  the  axillary  area  and  tw  symptomatic  spinal 

“  «”'L<  in  NP1  patients  «»>•  ft-*™* 


.  mnltinle  spinal  neurofibromas  affectmg  all 
the  presence  of  rauUiple  p  patients  leading  to 

spinal  roots  symmetrically  in  atypical  of  NF1,  a 

severe  neurological  impair  .  .  among  family  mem- 

disease  noted  for  its  variable  ^f^^^ific clinical 
bers  with  identical  ^^  ^^^^pf^Q^rpe'correlation  might 
findings  suggest  that  a  gen  yp  P  .fi  0f  gene 

>»  Ioann  »  «»s  £d»  « 

"i'  b.  peesent  m  to<  — 

“In  one  of  toe  the 

identified  previously,  a  unique  frameshift  at  the 
NF 1  gene  in  exon  46  8042insA^  athogenic  mutation  in 

We  have  now  identified  a  bona  ^  {amily  1#  a 

the  affected  members  of  bo.  ,  (7126+3A>C)  resulting 

splice  donor  mutation  wasf^e”*l39at(^e  mKNA  level.This  is 

in  out  of  frame  skipping  nreviously  in  classical  NF1 

a  novel  mutation  not  identified  p  ev.ously  m  g 

paT4!f<[^ 

unable  to  correlate  this  gen  yp  psnf  with  CAL  spots  is 
The  current  findings  underlme  that  FSNF  wim^  r  ^ 

not  only  lrnked  to  the  region  cieariy  allelic 

and  pEW206,  encompassing  the  NFJjo  ^  ^  ours 

with  NF1 .  ^thermore  takmg  the  data  from  Ar  ^ 

sparse 

classical  NF1.  Moreover,  it  is  unhkrty this 

C^fi^nm^thyth^— nsf^d-^ 

the  gene  8042msA  aSP *“8tly  Kaufmann  et  ar  described 

"fcwhh multiple  sP-al  tumours 

Although  overall  the  cutaneous hafcutaneous  neurofibromas, 
affected  patients,  several  of  bab  ution  was  a  splic- 

Tn  these  families  too,  the  underlying  N  Tt  has 

tZVu5A>G)  and  a  missense  mutation  (L2067P).  It  has 
mg  (IVS31-5A>0)  an  might  arise  through  a 

been  proposed  that  theFSNb  pneno  yp  *  ofibrQmin  with  a 

enco^^red,^ese°mutadonsnwdll 

transcripts  that  become  degra  J  For  ^  mutations 
ENA  decay,  leading  to  baPlo^"“^s  conceivable  that  a 

ne^tiwr«Wualfmicfion'mightstfflexKt,that^,thatamuttnt 

SSSBEI SSZS3KSSBL.*— 
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by  mutation  TO42ta^and^a^2353aminoaad^U'imc  ^  J> 

l£ 

EBV  cell  lines  of  two  Patl““  |  neurofibrofflta  was  found 
antibody  SC67,  but  n°*™dn**ink  that  this  is  enough 
(data  not  shown),  as  the  molecular  tools 

sole  factor 

pi..!., » *'r rs m  ™»£  r  «.  "> 

„  .  nmpto  of  similar  fia  ,„tens  «ere  fornid  M 


S  a  number  of  simttar  f^^ss^s?mutations  were  found  in 
he  NFI  gene,  splicing,  1ti6 »?  s*  /unpublished  data) 
►atients  without  spinal  mVolvement  l  P 

ind  with  classical  may  Vint  to 

accepted  that  bona  fid  _t_-  is  as  thev  may  lead  to 

rritical  functional  domains  m  a  pr°!^RasG^P  activity  of  the 
ihe  production  of  a  mutant  the 

central  GAP  related  r domam  characterised-  and 

GRD  of  neurofibromin  hare  this  domain  have 

the  effects  of  specifemKS^nsemttona,  signiflcance  has  been 
been  studied  m  detan,  ct  tog  exon  g  in  which 

contributed  so  far  to  t  ereg  R  ntly,  the  ncurofibromin 

the  L357P  mutauon  was  found,  Recently.^  ^  ^ 
content  of  cells ;  fr0®  u  was  found  lo  be  reduced  to 

neurofibromas  without  CAL  p  t  j  cells,  suggesting 

half  of  the  amount  of  tingly.  one  of  the  muta- 

functional  haploimuffiaeneyi ^  u(||p  In  order  to  sub- 
tions  tested  was  a  missen  we  have  performed  western 

stantiate  these  findings  f  '  ceR cultures  (n=4) 

blot  analysis  on  control  norm  independently  initiated 

and  fibroblast  cell  cult  (  '  tjon  L357P.  The  levels  of 

from  patient  11,5,  car^g  h  y  three  out  of  four  control 
NFI  protein  were  slightly  higher  m  L357p  cell  lines, 

normal  fibroblast  cell  lines  D57P  ^ 

No  apparent  differences  «"^^w|lh  FSNF,  This 
one  normal  control  colnP“  NFJPeene  is  not  a  housekeeping 
most  probably  is  because  the  NK1  |^nmarked  expresslo„ 
gene  and  the  same  cell  1  ,  «  tftaes  »  However,  we  do 

changes  when  analysed  at  di  »  ^  form  a  definitive 

not  think  that  this  eviden  lt  was  also  proposed  that  the 
conclusion  of  haplmnsuffic  cy^^  ^  ^  mutation, 

FSNF  was  the  result  of  the  pr  more  like(y  on  the 

either  on  the  the  alr]ad v  identified  mutation 

same  allele  that  also  con  kte  cascade  of  techniques 

(in  c/s).”  As  we  applied  P  dasslcal  NFI  patients" 


and  should  help  in  understanding  the  n»k*  alterations 
that  cause  this  severe  neurological  phenotype. 
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nd  did  5  pfcsemconcomltantly. 

S  it  b  P« ssible  «HH°H2o1 

;ene  lymg  within  the  r  g  botb  families  influences 

md EW206 and  shown  lobe  ™“bTe,notvpic  outcome. 

ers  in  both  families  wm  investigation.  It  has  been 

region  of  interest  for  in  ^  ^  close  proximity  to  the 

suggested  that  a  modifier  gen  ^  with  the  NFI  gene 

NFI  gene"  and  that„l“FScrodelefion"  phenotype  with  early 
results  m  the  severe  NF  ^  number  Qf  cutaneous 

onset  of  growth  and  example  with  a  dominant 

neurofibromas.  A  mutation  '*^;walcrest  cells  that 

pSSS  -  .»=  or  .ho 

are  responsible  for  the  N  I  ht  cooperate  with  the 

genes  flanking **  “f  S NF! furthermore,  the  same  type  of 
NFI  mutation  to  result  m  rw  mutation  or  occurring 

alteration,  present  in  tram  wi  causing  spinal  tumours 
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Duplicon-mediated  microdeletions  around  the  NFI  gene  are  frequently  associated  with  a  severe  form  of  neurofibromatosis 
type  I  in  a  subgroup  of  patients  who  show  an  earlier  onset  of  cutaneous  neurofibromas,  dysmorphic  facial  features,  and  lower 
IQ  values.  To  clarify  the  discrepancies  between  published  maps  of  the  NFI  tumor-suppressor  gene  region  as  well  as  the  length 
of  gaps  in  these  assemblies  and  to  validate  the  recently  described  tandem  duplication  of  the  human  NFI  locus,  we  assembled 
a  contiguous  high-density  map  of  BAC  and  PAC  clones  from  different  genomic  libraries.  Although  two  Wl-I 2393- derived 
low-copy  fragments  are  known  to  occur  at  the  proximal  and  distal  boundaries  of  the  1 .5-Mb  segment  that  is  usually  deleted 
in  NFI  microdeletion  patients,  we  identified  an  additional  W/-/2393-re lated  segment  between  the  MGCI306I  and  the  NFI 
gene,  which  appears  to  trigger  interstitial  deletions  of  smaller  size  as  observed  in  two  patients.  Moreover,  we  completed  the 
genomic  organization  and  cDNA  structure  of  all  functional  genes,  CYT0R4 ,  FLJI2735 ,  FLJ22729 ,  CENTA2 ,  MGCI306I ,  NF/, 
OMG,  EV/2B,  EVI2A}  KIAAI82I,  MGCI 1316 ,  HCA66,  KIAA0I60 ,  and  Wl-12393,  from  this  region.  A  comparison  of  the  human 
map  to  the  orthologous  region  on  mouse  chromosome  1 1  revealed  significant  differences  in  the  number  and  arrangement  of 
genes,  indicating  that  many  chromosomal  breaks  with  partial  duplications,  inversions,  and  deletions  occurred  predominantly 


in  the  primate  lineage.  ©  2003  Wiley-Liss,  Inc. 

INTRODUCTION 

Neurofibromatosis  type  I  (NFI)  [MIM  162200]  is 
an  autosomal  dominant  disorder  which  predisposes  to 
benign  and  malignant  tumors.  NFI  is  caused  by  mu¬ 
tations  of  the  gene  encoding  neurofibromin  at 
17qll.2.  The  detailed  and  precise  structure  of  the 
NFI  gene  and  its  flanking  regions  is  important  in  the 
context  of  mutation  analysis  in  NFI  patients  and  is 
essential  to  understand  the  pathogenetic  mechanisms 
underlying  submicroscopic  deletions  at  17qll.2.  In 
5-20%  of  NFI  patients,  a  heterozygous  deletion  is 
found,  which  includes  not  only  the  NFI  gene,  but 
also  further  genes  in  the  adjacent  regions  (Kayes  et 
al.,  1994;  Cnossen  et  a L,  1997;  Valero  et  al.,  1997; 
Rasmussen  et  al.,  1998;  Upadhyaya  et  al.,  1998).  The 
majority  of  these  interstitial  17ql  1.2  microdeletions 
of  about  1.5  Mb  are  caused  by  unequal  recombination 
between  two  highly  homologous  35-kb  spanning  seg¬ 
ments  (located  between  positions  113000  and  148000 
on  R-271K11,  accession  number  AG005562)  of  two 
low-copy  repeats  (LCRs),  which  contain  duplicated 
portions  of  the  KIAA0563- related  gene,  also  called 
WF12393 ,  and  chromosome  19p  13.1- derived  se¬ 
quences  (Lopez  Correa  et  al.,  1999,  2001;  Dorschner 


et  al.,  2000;  Jenne  et  al.,  2000,  2001).  These  LCRs  are 
localized  at  a  distance  of  about  400  kb  proximal  and 
700  kb  distal  to  the  NFI  gene. 

In  addition  to  the  bona  fide  NFI  gene  at  17qll.2, 
several  NFI  pseudogenes  have  been  identified, 
which  are  derived  from  the  proximal  part  of  the 
NFI  gene.  These  NFI- related  sequences  are  lo¬ 
cated  at  2q21  and  in  the  pericentromeric  region  of 
chromosomes  12,  14,  15,  18,  21,  and  22  (Legius  et 
al.,  1992;  Marchuk  et  al.,  1992;  Suzuki  et  al.,  1994; 
Purandare  et  al.,  1995;  Cummings  et  al.,  1996; 
Hulsebos  et  al.,  1996;  Kehrer-Sawatzki  et  al., 


Supported  by:  Deutsche  Forschungsgemeinschaft;  Grant  num¬ 
bers:  JE  194/1-1,  KF0113/1,  HA  1082/16-1,  16-2;  University  of  Mu¬ 
nich;  Grant  number:  SFB469;  Grant  numbers:  DAMD  17-00-1-0542, 
DAMD  17-01-1-0721  awarded  by  the  US  Army  Medical  Research 
Acquisition  Activity,  820  Chandler  St,  Fort  Detrick,  MD  21702- 
5014. 

^Correspondence  to:  Dr.  Hildegard  Kehrer-Sawatzki,  Depart¬ 
ment  of  Human  Genetics,  University  of  Ulm,  Albert-Einstein-Allee 
11,  D-89081  Ulm,  Germany. 

E-mail:  hildegard.kehrer-sawatzki@medizin.uni-ulm.de 
Received  9  October  2002;  Accepted  8  January  2003 
DOI  10.1002/gcc.  10206 
Published  online  10  March  2003  in 
Wiley  InterScience  (www.interscience.wiley.com). 


©  2003  Wiley-Liss,  Inc. 


112 


JENNE  ET  AL 


1997a;  Regnier  et  al.,  1997;  Luijten  et  ah,  2000).  All 
these  findings  suggest  that  pericentromerie  plastic¬ 
ity  and/or  local  chromosomal  instability  have  re¬ 
shaped  the  NF1  locus  and  adjacent  regions  quite 
often  during  evolution  of  primate  genomes. 

The  spread  of  multiple  NF1  pseudogene  frag¬ 
ments  onto  different  chromosomes  and  the  pres¬ 
ence  of  several  KIAA0563-xt\ztt&  sequences  on 
chromosome  arm  17q  have  provoked  further  spec¬ 
ulations  about  additional  local  duplications  of  the 
NF1  region.  Very  recently,  Gervasini  et  al.  (2002) 
provided  hybridization  evidence  for  a  tandem  du¬ 
plication  of  the  NF1  gene  and  its  immediately 
flanking  regions  at  17qlL2,  by  use  of  high-resolu- 
tion  FISH  on  stretched  chromosomes  and  DNA 
fibers.  According  to  their  observations,  the  NF1 
gene  is  not  only  surrounded  by  duplicons  derived 
from  the  WF12393  (KIAA0563- related)  gene  at  a 
distance  of  400  and  700  kb  on  each  side,  but  also 
might  be  organized  as  a  highly  homologous  tandem 
repeat  within  17qll.2.  We  were  puzzled  by  these 
findings,  given  that  NF1- related  sequences  adja¬ 
cent  to  the  bona  fide  NF1  gene  at  17ql  1.2  have  not 
been  detected  in  previous  PCR  and  microsatellite 
analyses  (Jorde  et  ah,  1993;  Messiaen  et  al.,  1993; 
Li  et  al.,  1995;  Purandare  et  al.,  1996;  Valero  et  al., 
1996),  physical  mapping  studies  (Fountain  et  ah, 
1989a, b;  Ledbetter  et  ah,  1989;  Yagle  et  aL,  1990), 
or  cytogenetic  studies  of  constitutional  transloca¬ 
tions  such  as  the  t(17;22)  (q!L2qll.2),  which  dis¬ 
rupts  the  NF1  gene  in  intron  31  (Kehrer-Sawatzki 
et  al,  1997b,  2002). 

Facing  several  controversial  reports  on  the  struc¬ 
ture  of  the  NF1  region  (Lopez  Correa  et  ah,  1999, 
2001;  Dorschner  et  al,  2000;  Jenne  et  al,  2000, 
2001;  Riva  et  al.,  2000,  2002;  Bentivegna  et  al, 
2001;  Gervasini  et  al.,  2002)  and  the  still  existing 
discrepancies  between  the  Celera,  the  NCBI,  and 
the  EMBL  maps  around  the  NF1  locus,  we  re¬ 
examined  all  existing  data  and  performed  fluores¬ 
cence  in  situ  hybridization  (FISH),  bacterial  artifi¬ 
cial  chromosome  (BAG)  end  mapping,  and  marker 
analyses  to  confirm  the  map  of  the  entire  1.5-Mb 
region  in  great  detail.  Moreover,  we  compared  the 
human  NF1  gene  with  the  orthologous  region  in 
the  mouse  and  found  clear  differences  in  the  num¬ 
ber  and  order  of  genes  in  both  species. 

MATERIALS  AND  METHODS 

FISH  and  BLAST  Analysis 

BACs  and  PI  artificial  chromosome  clones 
(PACs)  were  purchased  from  Research  Genetics 
(Huntsville,  AL)  and  from  the  BACPAC  Resource 


Center  (wwv.chori.org/bacpac).  DNA  was  isolated 
from  liquid  cultures  of  150  ml  by  use  of  the  Midi 
DNA  isolation  kit  (Qiagen,  Hilden,  Germany).  Se¬ 
quence  alignments  wrere  determined  by  BLAST 
analysis  against  all  available  databases  (http://www. 
ncbi.nlm.nih.gov/blast)  and  assembled  by  use  of 
the  GCG  software  package  (Version  10.2;  Oxford 
Molecular  Group,  Beaverton,  OR)  and  the  Se~ 
quencher  software  (Gene  Codes,  Ann  Arbor,  MI). 

The  regional  probe  spanning  WF12393  exon  1 
was  amplified  by  PCR  by  use  of  the  Expand  Long- 
Template  PCR  system  (Roche-Diagnostics,  Mann¬ 
heim,  Germany)  with  primers  DJ  1686  (5'-CAG 
GTT  ATA  GGG  AAG  GAG  GAC-30  and  DJ  1863 
(S'-AGC  AGC  GGT  TAA  GCA  ATG  ATG-3T 

The  6-kb  spanning  PCR  product  was  cloned  with 
the  TOPO  TA  cloning  system  (Invitrogen,  Carls¬ 
bad,  CA)  and  used  as  a  FISH  probe.  Metaphase 
spreads  were  prepared  either  from  peripheral  blood 
lymphocytes  or  lymphoblastoid  cell  lines.  Inter¬ 
phase  nuclei  for  genomic  distance  measurements 
were  obtained  from  G1  synchronized  fibroblast  cul¬ 
tures  by  use  of  standard  methods.  BAG  DNA  and 
the  plasmid  containing  the  WI- 12393  exon  1  were 
labeled  by  nick  translation  by  use  of  biotin- 16- 
dUTP  (Roche-Diagnostics)  or  digoxygenin-11- 
dUTP  (Roche-Diagnostics).  Probes  were  either  vi¬ 
sualized  with  FITC-avidin  and  biotinylated  anti- 
avidin  (Vector  Laboratories,  Burlingame,  CA)  or 
detected  by  mouse  anti-digoxygenin  antibodies 
coupled  with  Texas-Red  and  in  a  second  step 
by  anti-mouse  antibodies  conjugated  with  Texas- 
Red  (Dianova,  Hamburg,  Germany).  Slides  were 
counterstained  with  diamidinophenylindole  and 
mounted  with  Vectashield  antifade  solution 
(Vector  Laboratories). 

Patients  With  1 7q  I  1,2  Microdeletions  of  Smaller 
Size 

The  clinical  phenotype  and  the  age  of  two  NF1 
patients  (TOP,  UWA113-1)  with  deletions  smaller 
than  1.5  Mb  (Dorschner  et  al,  2000;  Jenne  et  al, 
2000)  are  summarized  in  Table  1.  This  table  also 
includes  the  clinical  data  of  patient  UW155-1, 
whose  proximal  deletion  breakpoint  was  located 
within  the  interval  of  markers  SHGC35088 
(RH33498)  and  FB12A2  and  the  distal  deletion 
breakpoint  between  markers  D 17$ 1656  and 
st$G50857  teiomeric  to  the  distal  LCR  (Dorschner 
et  al.,  2000). 

BAC  Library  Screening 

Filters  from  the  RPCI-11  human  BAC  library 
and  the  RPCIl,3-5  human  PAG  libraries  were  ob- 
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TABLE  I.  Clinical  Features  of  Patients  With  NF/  Deletions  Smaller  Than  the  Usual  Duplicon-Mediated  Deletion 


Feature 

TOP 

UWA-II3-I 

UWA- 155-1 

Age  (years) 

11.5 

8 

27 

Number  of  cutaneous 
neurofibromas 

45 

TNTCa 

TNTC 

Height  (cm,  percentile) 

147.5  (97%) 

unk.b 

unk. 

Head  circumference 

56  (50%) 

59  (>97%) 

59.5  (>97%) 

Facies 

Coarse,  large  nose 

Coarse,  hypertelorism 

Coarse,  ptosis 

Intelligence 

IQ  59 

IQ  88 

Moderate  MR 

Hands 

97% 

>97% 

>97% 

Feet 

>97% 

>75% 

Other  tumors 

Optic  glioma  suspected 

— 

Spinal  neurofibroma 

No  spinal  MR! 

Malignant  peripheral  nerve 
sheath  tumor  at  age  28 

aTNTC,  too  numerous  to  count. 
bunk.,  unknown. 


tained  from  the  Resource  Center  (www.rzpd.de) 
and  the  BACPAC  Resource  Center  (www.chori. 
org/bacpac).  Screening  of  the  libraries  was  per¬ 
formed  by  hybridization  with  STS  markers  ampli¬ 
fied  by  PCR  and  labeled  by  random  priming  with 
hexamer  oligonucleotides  and  Klenow  polymerase 
in  the  presence  of  j2P-dCTP  (Amersham  Pharma¬ 
cia,  Freiburg,  Germany)  in  buffer  containing  7% 
SDS,  0.5  M  Na-phosphate  (pH  7.2),  and  1  mM 
EDTA  at  65°C  overnight.  Filters  were  washed  four 
times  at  increasing  stringency,  with  a  final  wash  of 
0.2 X  SSC/0.1%  SDS  for  45  min.  After  autoradiog¬ 
raphy  for  1-2  days  at  -70°C  with  intensifying 
screens,  positive  clones  were  identified  by  x/y  co¬ 
ordinates  and  obtained  from  BACPAC  resources  or 
the  Resource  Center. 

Characterization  of  Full-Length  cDNAs 

Full-length  transcripts  of  the  FU 12735, 
MGC13061,  KIAA182 ,  MGC11316,  and  FU11040 
genes  were  determined  by  RT-PCR  with  human 
cDNAs  obtained  by  reverse  transcription  by  use  of 
the  Superscript  II  system  (Life  Technologies, 
Gaithersburg,  MD)  of  RNA  from  various  tissues 
(Total  RNA  panels  I  and  II;  Clontech,  Palo  Alto, 
CA).  The  structure  of  the  RT-PCR  products  was 
analyzed  by  sequencing  by  use  of  the  Big-Dye 
Terminator  kit  (Applied  Biosystems,  Foster  City, 
CA).  The  structure  of  the  KIAA1821  gene  was 
further  confirmed  by  Northern  blot  hybridizations 
by  use  of  the  human  MTN  Blot  (Clontech)  accord¬ 
ing  to  the  manufacturer’s  instructions. 

RESULTS  AND  DISCUSSION 

Notwithstanding  the  enormous  progress  in  con- 
tig  assembly  and  gene  annotation,  it  appears  that 


most  of  the  mapping  problems  of  the  human  NFI 
gene  region  originate  from  ambiguities  and  uncer¬ 
tainties  in  two  genomic  segments  located  between 
the  BACs  R-271K11  and  468F23,  and  on  BAC 
R-640N20.  These  two  regions  are  not  fully  covered 
by  BACs  and  sequence  assemblies  in  online-ge¬ 
nome  databases.  More  important,  several  of  the 
genes  from  the  1.5-Mb  spanning  interval  are  only 
partially  annotated  by  automatic  computer  algo¬ 
rithms  and  are  deduced  from  very  few  experimen¬ 
tal  data. 

To  this  end,  we  extended  our  previous  assem¬ 
blies  and  constructed  a  BAC/PAC  contig  of  high 
clone  density  between  the  proximal  WI-12393-re- 
lated  duplicon  and  the  unique  FLJ11040  gene, 
which  flanks  the  distal  duplicon.  We  reinvestigated 
the  structure  of  the  genes  previously  mapped  to 
the  NFI  gene  region  (Dorschner  et  al.,  2000;  Jenne 
et  al.,  2001)  and  integrated  their  position  into  our 
consensus  contig  (Fig.  1).  Overall,  19  new  BACs 
from  different  libraries  were  identified  and  inte¬ 
grated  into  the  map  by  BLAST  analysis  of  end 
sequences  and  marker  analysis.  BAC  R-229K15 
(AC027793)  bridges  the  region  between  BAC 
468F23  (AC004666)  and  R-14206  (AC079915)  and 
thus  fills  the  sequence  gap  between  468F23  and 
the  5'  promoter  region  of  the  NFI  gene.  Because 
the  entire  BAC  insert  has  now  been  sequenced, 
R-229K15  can  clearly  be  put  at  this  position,  al¬ 
though  it  shares  considerable  homology  with 
R-271K11,  in  particular  WI-12393  pseudogene  ex¬ 
ons  as  outlined  below.  Additionally,  the  meanwhile 
fully  sequenced  BACs  2349P21,  3098J15,  and 
R-785C15  could  be  mapped  unambiguously  in  the 
5'  direction  of  BAC  468F23,  thus  extending  the 
contig  to  the  centromeric  direction  up  to  the  most 
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Figure  I.  Complete  physical  and  transcript  map  around  the  NFi 
locus  between  the  proximal  and  distal  duplicons  confining  various 
fragments  of  the  Wi-12393  gene.  Immediately  5?  to  the  NFI  gene,  a  third, 
Wi-12393  pseudogene  fragment  {medial  duplicon)  was  identified.  Green 
boxes  indicate  die  low-copy  repeats  with  high  homology  to  sequences 
on  chromosome  19  (AC01 I5Q9;  AC0G8569),  The  arrows  above  the  line 
indicate  the  genomic  length  and  transcript  direction  of  the  genes  mapped 
to  this  region.  Functional  genes  are  indicated  in  black;  pseudogene  (ps) 
exons  of  the  SMURF2 ,  die  Wi-12393,  and  the  KIAAGI6G  genes  are  given 
in  red.  Indicated  below  the  line  are  markers  that  anchor  the  fully 
assembled  BAC/PAC  contig.  Each  horizontal  bar  represents  the  length  of 


a  given  BAC  or  PAC  done.  Completely  sequenced  clones  are  marked  by 
bars  in  orange,  whereas  partially  sequenced  clones  are  indicated  by 
dashed  black  lines.  The  contig  was  assembled  by  BLAST  analysis  (http;// 
www.ncbi.nlm.nih.gov/),  FISH  and  marker  analysis  by  FCR.  The  19  newly 
mapped  clones  are  marked  by  asterisks.  Processed  pseudogenes  like 
EEN  and  YZAI  were  omitted.  The  accession  numbers  of  the  fully  se¬ 
quenced  clones  are  AC005562  (271  Kit),  AC127024  (2349P21), 
AC  1 095 16  (785C15),  AC091I77  (3098J15),  AC004666  (468F23), 
AC027793  (229KI5),  AC0799I5  (14206),  AC004222(499I29), 

AC004526  (23013,  14502),  AC003 101  (41C23),  AC007923  (805L22), 
and  AC004253  (542B22). 


proximal  WI-12393- related  sequences.  The  struc¬ 
ture  of  the  region  between  BACs  R-271K11  and 
R-785C15  was  confirmed  by  marker  analysis  of  the 
newly  identified  BACs  738C9,  2250N21,  943L1G, 
and  25Q6D2Q,  and  their  position  is  in  full  harmony 
with  our  map  (Fig.  1). 

In  the  3?  direction  to  the  NFI  gene,  the  com¬ 
pletely  sequenced  BAC  R-805L22  was  clearly 
placed  between  BACs  41C23  and  542B22.  The 
accuracy  of  the  map  V  to  the  NFI  gene  was  further 
corroborated  by  the  identification  of  BAC 
R-772K16,  which  consists  of  fully  sequenced  unor¬ 
dered  segments.  The  ordered  overlap  of  BACs 
41C23  and  R-805L22,  BACs  R-805L22  and 
542B22,  and  BACs  542B22  and  307A16 
(ACG03041)  has  been  confirmed  by  FISH  analysis 
in  interphase  nuclei  with  differentially  labeled 
BAC  probes  (data  not  shown).  We  investigated  the 
genomic  segment  between  BACs  542B22  and 
R-474K4  in  great  detail  because  this  region  is  still 
not  covered  by  a  continuous  sequence.  We  inte¬ 


grated  the  completely  sequenced  BAC  2095E4, 
which  overlaps  with  BAC  R-474K4  and  R-640N20. 
Because  the  continuous  sequence  of  BAC 
R-640N20  (AC0906I6,  version  8)  was  not  available 
in  public  databases,  we  ordered  the  eight  pieces  of 
R-640N2G  in  harmony  with  the  cDNA  sequences 
of  the  KIAA0160  and  WI-12393  transcripts  and  con¬ 
firmed  this  order  by  PCR  and  RT-PCR  analysis. 
The  position  of  six  further  BAC  clones  integrated 
into  the  contig  in  this  particular  region  (R-164M10, 
2517E10,  2283118,  R-95306,  2646J6,  and 

R-951F11)  is  fully  consistent  with  the  assembly  of 
BAC  R-640N20  and  the  map  around  the  distally 
located  W/-/Z?P3-related  sequences. 

In  addition,  we  extracted  and  compiled  the 
mouse  homologs  for  the  human  genes  mapped  to 
the  NFI  gene  region  from  BAC  clones  that  had 
been  fully  sequenced  and  deposited  at  the  EMBL 
or  GenBank  database.  The  structure  and  order  of 
the  orthologous  mouse  loci  around  the  NFI  gene 
are  illustrated  in  Figure  2.  By  comparative  se- 
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Figure  2.  Physical  map  of  the  human  (A)  and  murine  NFI  region  (B) 
and  tentative  NFI  region  in  ancestral  primates  (C),  from  centromere 
(left)  to  the  telomere  (right).  The  arrows  indicate  the  genomic  length 
and  transcript  direction  of  the  genes  mapped  to  the  respective  regions. 
Functional  genes  are  indicated  in  black;  pseudogene  (ps)  exons  are  in 
red.  Marked  by  yellow  boxes  are  the  genes  that  have  been  duplicated 
and  relocated  during  primate  evolution  as  deduced  from  comparison  of 
the  human  and  mouse  NFI  gene  region.  The  MGCI 1316  and  WI-12393 
genes  (marked  by  black  frames)  were  either  inserted  during  primate 
evolution  or  deleted  in  the  rodent  lineage.  The  proposed  structure  of 


quence  analysis  of  the  human  and  mouse  genome, 
we  identified  a  total  of  14  human  genes  between 
the  proximal  and  the  distal  duplicons  and  updated 
the  full-length  transcript  structure  of  five  genes 
(FU 12735,  MGCI 3061,  KIAA182,  MGCI  1316,  and 
FU11040)  (Table  2).  Our  analysis  of  the  fully  as¬ 
sembled  clone  contig  presented  here  disproves  a 
local  duplication  of  the  NFI  gene  at  17qll.2  as 
reported  by  Gervasini  et  al.  (2002).  However,  we 
identified  an  additional  truncated  copy  of  the  WI- 
12393  gene  on  BAC  R-229K15  (AC027793)  and 
located  the  latter  between  the  proximal  duplicon 
and  the  5'UTR  of  the  NFI  gene  (Fig.  1).  Gervasini 
et  al.  (2002)  used  BAC  R-977D19  as  a  FISH  probe 
to  investigate  the  structure  of  the  NFI  gene  region. 
This  BAC  spans  not  only  the  5'  region  of  the  NFI 
gene  but  also  contains  a  third  non-processed  pseu- 


the  NFI  gene  region  in  an  ancestral  primate  (C)  is  characterized  by  the 
duplication  and  relocation  of  seven  genes  to  the  proximal  side  of  the  NFI 
gene.  This  duplication  event  was  followed  by  a  deletion  of  five  genes 
(marked  in  blue)  on  the  distal  side.  Indicated  in  red  are  the  copies  of  the 
KIAA0I60  and  WI-12393  genes  on  the  proximal  side  of  the  NFI  gene, 
which  have  been  partially  lost  during  evolution.  We  assume  that  the 
primordial  WI-12393  gene  was  located  between  MGCI 306 1  and 
FL)  1 1 040  in  the  primate  ancestor,  showing  the  same  transcriptional 
orientation  as  the  present-day  pseudogene  copy  on  BAC  R-271KI  I  (red 
labels  with  frame),  which  spans  exons  8  to  1 2. 


dogene  copy  of  the  WI-12393  gene  (medial  dupli¬ 
con,  Fig.  1).  Lack  of  awareness  of  this  additional 
LCR  may  have  contributed  to  the  misinterpreta¬ 
tion  of  fiber  FISH  experiments  and  the  supposed 
NFI  gene  duplication  (Gervasini  et  al.,  2002). 

The  medial  duplicon  may  also  be  implicated  in 
homologous  recombinations  with  the  distal  W I- 
12393- containing  LCR  for  the  reasons  outlined 
below.  Two  patients,  TOP  (Jenne  et  al.,  2000)  and 
UW113-1  (Dorschner  et  al.,  2000),  have  been  re¬ 
ported  with  deletions  of  shorter  length.  These  de¬ 
letions  were  observed  in  all  peripheral  blood  cells 
and  most  likely  represent  constitutional  germline 
deletions.  Although  their  breakpoints  have  not 
been  mapped  at  the  sequence  level,  our  analyses 
suggest  that  both  the  proximal  and  distal  break¬ 
points  in  these  two  patients  are  located  within 
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TABLE  2.  Summary  of  the  Functional  Genes  Around  the  NFI  Locus  at  I7ql  L2  and 
House  Orthologs  on  Chromosome  1 1  (1  IBS) 


Gene  designation  (marker) 

Accession  number  of  the 
human  cDNA  (length  in 
bp/number  of  exons) 

Accession  number  of  the 
homologous  mouse 
cDNA 

(length  in  bp) 

Features  of  the  encoded  protein 

CYTOR4/CRLF3  (SHGC-37343) 

NM_0 15986  (3320/8) 

NM  J) 1 8776  (2365) 

Cytokine  receptor-like  factor  3 

Fiji 2735 

AJ3146483  (6245/28) 

XM_  126278  (1002) 

Putative  ATP/GTP-binding 
protein 

FQ22729  (SHGC-2390) 

AJ3 14645  (1306/4) 

AK003325  (1260) 

XM_  126322  (1257) 
XMJ09915  (1257) 

CENT2A  (SHGC-34232) 

NM_0 18404 

AJ238994  (1796/1!) 

BC027I65  (1611) 

XMJ  26230  (2289) 

XM_  1 099 1 4  (2289) 

lnositoI(  1 ,3,4,5)tetrakisphosphate 
receptor 

MGC 13061 

AJ4967293  (2467/5) 

NM_028019  (1961) 

Contains  C3HC4  zinc  finger, 
signal  peptide 

NFI 

NM_O0G267  (89597/60) 

LI 0370  (11690) 

XMJ26233  (11681) 

Ras  GTPase  stimulating  protein 

OMG 

XM_083979  (1805/2) 

NMJ)  19409  (1794) 

Oligodendrocyte  myelin 
glycoprotein;  signal  peptide 
toll-like  receptor;  leucin-rich 
repeat  protein 

EVI2B 

NM_006495  (1983/2) 

BC0 17548  (1855) 

XM_  1 26388  (1866) 

Prolin-rich,  transmembrane 
segment 

EVI2A 

NM_0I4210  (1563/2) 

NM_0I0I6I  (1613) 

Signal  peptide  and 
transmembrane  segment 

KIAAI82I  (Wl-9521) 

AJ3 14646*  (8311/15) 

XM_  137757  (3348) 

Similar  to  KIAA0665  (Rab  1 1  - 

Exon  1  missing 

XMJ  10986  (3384) 

family  interacting  protein  3), 
Eferin;  EF  hand;  coiled  coil 

MGCI 1316  (WI-6742) 

AJ2721963  (918/4) 
XM_Q56730 

NM_025556b  (789) 

AK0 10573  (789) 

HCA66 

NM_0 18428  (2032) 

NM„  144826  (2081) 

Hepatocellular  carcinoma- 
associated  antigen  66 

XM_050084  (2021/19) 

XMJ  10000  (3022) 

HAT  (half  a  tetratricopeptide) 
repeat 

KIAAO 1 60/JJAZ I 

NM_Q  15355  (4441/16) 

XMJ  26 195  (3996) 

Contains  C2H2  zinc  finger 
domain 

KIM0563reI  (WI- 12393) 

AJ3 14647  (3017/12) 

_ c 

K/AA0J63-related 

Fiji  1040 

AJ496730*  (2453/19) 

AK0 19059  (3000) 

RIF  (—  rho  in  filopodia); 

mitochrondrial  rho-1,  M1RO-I 
rho-related  GTP  binding 
protein 

*This  work;  updates  and  full-length  cDNAs  are  reported. 

bThe  mouse  homolog  of  the  MGC/ 1316  gene  is  located  on  mouse  chromosome  8A2. 

CA  gene  with  similarity  to  human  KIAA0563ret  is  located  on  mouse  chromosome  I  IDS  (XMJ  37868), 


WI-12393-re\&t£d  sequences  of  the  medial  and  dis¬ 
tal  duplicons  (Fig.  3).  Marker  analysis  of  a  hybrid 
cell  line  carrying  only  the  deleted  chromosome  17 
homolog  of  UW113-1  permits  us  to  narrow  the 
location  of  the  proximal  breakpoint  in  this  patient 
between  marker  FB12A2  and  exon  1  of  the  NFI 
gene.  The  six  exons  (12  and  5-1)  of  the  WI- 12393 
pseudogene  copy  on  BAG  R-229K15  are  dispersed 
between  position  69416  and  86281  (numbering  ac¬ 
cording  to  AC027793)  and  are  arranged  in  the  same 


Figure  3.  At  Schematic  representation  of  the  distribution  of  the 
W 1-12393  exons  and  pseudogene  exons  in  the  proximal*  medial*  and 
distal  duplicons  of  17ql  L2,  The  hybridization  sites  of  the  PCR  product 
amplified  with  primers  DJI 686  and  DJI863  (probe  Dj  1686/1863)  are 
shown  by  green  rectangles.  The  arrows  indicate  the  orientation  of  the 
WI- 12393  exons  and  pseudogene  exons.  The  relative  map  positions  of 
BACs  782A21,  R-977DI9,  and  R-640N2G  are  also  indicated.  FISH 
analysis  of  BACs  782A2I  and  R-640N20  to  metaphase  chromosomes  of 
patient  TOR  resulted  in  reduced  signals  on  the  chromosome  17  with 
the  microdeletion.  B;  FISH  with  probe  Dj  1686/1863  (green  signals)  on 
metaphase  chromosomes  of  NFI  patient  TOP  revealed  signals  on  only 
one  chromosome  17  homolog  (open-headed  arrow).  Co-hybridization 
was  performed  with  the  BAC  clone  R-55A13  (red  signals)  mapping  to 
1 7q24. 
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orientation  as  the  pseudogene  copy  on  R-640N20 
that  recombines  in  the  majority  of  NF1  microde¬ 
letion  patients.  Our  FISH  data  for  patient  TOP 
indicate  that  the  centromeric  breakpoint  is  located 
on  BAG  R-782A21  distal  to  BAC  468F23. 
R-782A21,  which  overlaps  with  the  distal  portion  of 
BAC  R-229K15  up  to  position  65277,  shows  a  sig¬ 
nal  of  reduced  intensity  on  one  chromosome  17 
homolog  of  patient  TOP,  whereas  the  proximally 
flanking  BAC  468F23  yielded  a  signal  of  equal 
intensities  over  both  chromosomes  17.  Moreover, 
FISH  analysis  with  a  sub-regional  probe  spanning 
exon  1  of  the  WI-12393  gene  confirmed  the  dele¬ 
tion  of  this  region  in  patient  TOP  (Fig.  3).  There¬ 
fore  we  assume  that  the  proximal  deletion  break¬ 
point  of  patient  TOP  most  likely  lies  in  the  region 
of  the  WI-12393  pseudogene  exons  2-5  and  12. 
The  distal  breakpoint  of  the  deletion  of  patients 
TOP  and  UW113-1  is  located  in  the  region  of  the 
distal  duplicon,  as  shown  previously  (Dorschner  et 
ah,  2000;  Jenne  et  ah,  2000).  Thus,  we  anticipate 
that  the  IVZ-/25£?-related  sequences  on  BAC 
R-229K15  can  also  serve  as  a  template  for  recom¬ 
binations  with  the  distal  pseudogene  copy  on 
R-640N20. 

Immediately  adjacent  to  the  WI- 12393- derived 
sequences  on  BAC  R-225K19  lies  another  non- 
processed  pseudogene  fragment,  which  is  derived 
from  the  SMURF2  gene.  A  second  incomplete  copy 
of  SMURF 2  is  found  on  BAC  R-271K11  centro¬ 
meric  to  the  WI-12393  pseudogene  copy  of 
R-271K11  (Fig.  1).  The  physical  vicinity  of 
SMURF 2  and  WI-12393  pseudogene  fragments  at 
these  two  sites  suggests  a  discontinuous  duplica¬ 
tion  of  this  genomic  segment  on  the  centromeric 
side  of  the  NF1  gene  during  primate  evolution. 
The  functional  copy  of  the  SMURF 2  gene  encod¬ 
ing  E3  ubiquitin  ligase  (NM_G22739),  however,  is 
not  clustered  with  the  functional  WI- 12393  gene, 
but  maps  to  BAC  R-87N3  (AC009994)  on  17q22. 

Our  detailed  analysis  of  the  continuous  map  be¬ 
tween  BACs  R-271K11  and  R-640N2O  shows  that 
the  NF1  gene  region  is  littered  with  four  groups  of 
low-copy  repeat  sequences  derived  from  W 1-123 9 3 
(proximal,  medial,  and  distal  copies),  SMURF 2, 
KIAA0160 ,  and  the  LEC2  (Lecotmedin  2 )ICD97 
(Leukocyte  antigen  CD97)  intergenic  region  on 
chromosome  19,  as  illustrated  by  Figure  1.  Two 
similar  fragments  of  the  SMURF2  gene  are  found  at 
the  proximal  boundary  of  the  wo  WI-12393  frag¬ 
ments  on  the  centromeric  side  of  the  NF1  gene.  A 
large  90-kh  segment  originating  from  the  inter¬ 
genic  region  between  LEC2  and  CD97  on  chromo¬ 
some  segment  19pl3.1  (Fig.  1,  green  rectangles)  is 


Interspersed  between  the  proximal  WI-12393  (ex¬ 
ons  8-12)  and  the  KIAA0160  pseudogene  copy. 
The  proximal  portion  of  this  segment  between 
115000  and  146200  on  R-271K11  (AC005562)  to¬ 
gether  with  exons  2  to  5  of  the  WI-12393  gene 
reoccurs  on  the  telomedc  side  of  the  NF1  gene  on 
BAC  R-640N2G. 

To  investigate  the  multiple  rearrangements  that 
occurred  during  primate  evolution,  we  performed  a 
comparative  analysis  between  the  human  map 
around  the  NF1  locus  and  the  corresponding 
mouse  syntenic  region  on  chromosome  11  (Fig.  2). 
All  of  the  duplications  and  pseudogene  fragments  . 
described  above  are  not  present  in  the  mouse  ge¬ 
nome,  suggesting  that  the  corresponding  local  re¬ 
arrangements  and  segmental  relocations  occurred 
after  the  divergence  of  rodents  and  humans  about 
SOM  years  ago.  The  long-range  organization  of  the 
orthologous  region  in  the  mouse  appears  to  reflect 
the  primordial  mammalian  organization  and  gene 
order  more  faithfully  (Fig.  2C). 

Guided  by  parsimony  considerations,  we  assume 
that  the  gene  order  (from  centromere  to  telomere) 
NFL  OMG ,  EVI2B,  EVI2A ,  K1AA182L  MGC11316 , 
HCA66 ,  KIAA0160 ,  CYTOR4,  FLU 2735,  FU22729 , 
CENTA2 ,  MGC1306L  WI-12393 ,  and  FLU 1040  rep¬ 
resents  the  ancestral  order  and  covers  a  set  of  15 
functional  genes.  Only  two  events  are  supposed  to 
have  altered  this  gene  ensemble  In  the  rodent  lineage 
during  evolution,  that  is,  the  irreversible  loss  of  Wi- 
12393  and  the  translocation  of  Mgcll316  onto  mouse 
chromosome  8  (Fig.  2B).  By  contrast,  the  primate 
region  has  experienced  a  number  of  duplications, 
local  rearrangements,  and  deletions,  the  largest  of 
which  appears  to  be  the  duplication  and  relocation  of 
the  genomic  segment  including  the  KIAA0160-CY- 
T0R4-FU 12735-FU22729 -CENTA2-MGC13061- 
WI-12393  genes  to  the  centromeric  side  of  the  NF1 
gene  (Fig.  2C).  We  assume  that  the  primordial  WI- 
12393  gene  was  inserted  between  MGC13061  and 
FU 11040  in  the  primate  ancestor  and  kept  the  same 
transcriptional  orientation  as  the  present-day  pseudo¬ 
gene  copy,  which  spans  exons  8  to  12  on  BAC 
R-271K11  (Fig.  2A,  C,  red  labels  boxed).  The  order 
and  transcriptional  orientation  of  the  CYTOR4 , 
FU12735 ,  FU22729 ,  CENT2A ,  and  MGC13Q61 
genes  on  the  5'  side  of  the  human  NF1  locus  are 
consistent  with  the  present-day  arrangement  of  the 
homologous  mouse  genes  on  the  3'  side  of  the  mouse 
Nfl  locus.  Most  probably,  the  duplication  and  reloca¬ 
tion  of  the  seven  genes  to  the  proximal  side  to  the 
NF1  gene  in  an  ancestral  primate  predisposed  these 
regions  to  further  local  modifications,  such  as  the 
inactivation  of  the  K1AA0160  and  WI-12393  genes 
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with  local  inversion  and  duplications  of  the  latter 
forming  the  present-day  proximal  and  medial  dupli- 
cons.  The  90-kb  fragment  from  chromosome  19, 
ranging  from  position  115000  to  203000  (numbering 
according  to  R-271K11,  AC005562),  is  by  far 
co-linear  with  the  5'  upstream  region  of  LEC2 ,  with 
the  exception  of  a  small  WI-12393  insertion  (positions 
146200  to  147900,  exons  5  and  4).  A  subfragment  of 
this  segment  (115000  to  146000,  R-271K11  number¬ 
ing)  followed  by  WI-12393  exons  2  to  5  constitutes 
the  distal  duplicon  telomeric  to  the  functional  WI- 
12393  gene  (green  boxes).  Further  insertions  of 
ALOX12B  (arachidonate  12-lipoxygenase)  fragments, 
SMS  (Smith-Magenis  syndrome)-related  repeats 
(Park  et  al.,  2002),  and  SMURF2  fragments  have 
probably  created  the  complex  pattern  of  the  present- 
day  human  sequence  that  is  covered  by  the  BAC 
R-271K11. 

By  contrast  to  humans,  the  mouse  Nfl  gene  is 
located  in  the  middle  part  of  chromosome  11  (band 
B5)  and  not  in  a  subcentromeric  region.  It  seems 
plausible  to  assume  that  the  multiple  duplications 
and  relocations  in  the  NF1  gene  region  during 
primate  evolution  were  favored  by  the  pericentro- 
meric  location  of  this  region  in  the  primate  ances¬ 
tor.  As  revealed  by  in  silico  analyses,  pericentro- 
meric  and  subtelomeric  regions  show  a  three-  to 
fivefold  increase  in  segmental  duplications  (Bailey 
et  al.,  2001),  which  foster  further  structural  rear¬ 
rangement  as  outlined  recently  (Eichler,  2001; 
Samonte  and  Eichler,  2002). 

Comparative  maps  of  the  human  and  murine 
NF1  region  clearly  highlight  the  close  clustering  of 
multiple  evolutionary  breakpoints  and  pathological 
breakpoints  that  have  been  identified  in  NF1  mi¬ 
crodeletion  patients.  Coincidence  of  evolutionary 
breakpoints  at  the  sites  of  low-copy  repeats  and 
repetitive  pseudogene  fragments  has  also  been  ob¬ 
served  in  the  Williams-Beuren  syndrome  region  at 
human  chromosome  region  7ql  1.23  (Valero  et  al., 
2000). 

Patients  with  segmental  haploidy  for  all  14  genes 
(Lopez  Correa  et  al.,  1999,  2001;  Dorschner  et  al., 
2000;  Jenne  et  al,  2000,  2001)  as  well  as  those  rare 
NF1  patients  (patients  TOP,  UW113-1,  and 
UWA155-1)  Qenne  et  al.,  2000;  Dorschner  et  al., 
2000)  (Table  1)  who  lack  only  the  nine  distal  genes, 
but  are  diploid  for  the  five  proximal  genes  ( CYTOR4 , 
FU 12735,  F LI 2 27 29,  CENTA2 ,  MGC13061),  appear 
to  develop  a  large  number  of  benign  neurofibromas  at 
an  earlier  age  (Table  1).  This  clinical  observation 
leads  us  to  suggest  that  haploidy  for  one  of  the  nine 
distal  genes  may  indeed  predispose  the  patients 
with  microdeletions  to  early  tumor  development.  At 


present,  the  most  intriguing  candidate  with  regard  to 
neurofibroma  growth  acceleration  is  the  KIAA0160 
gene  ( JJAZ1 ,  joined  to  JAZF1).  JJAZ1  has  been 
shown  to  be  disrupted  by  somatically  acquired  trans¬ 
location  t(7;17)(pl5;q21)  in  stroma-derived  endome¬ 
trial  sarcomas  (Koontz  et  al.,  2001).  Our  detailed 
physical  description  of  the  entire  NF1  flanking  region 
clearly  reveals  the  position  of  the  functional 
KIAA0160  ( JJAZ1 )  gene  on  the  distal  side  of  the  NF1 
gene  as  the  fusion  partner  in  t(7;17)  translocations. 
Acquired  alterations  of  its  normal  function  and  ex¬ 
pression  levels  might  contribute  to  enhanced  cellular 
growth  and  transformation  of  Schwann  cells  in  mi¬ 
crodeletion  patients. 

Another  frequently  observed  clinical  feature  of 
NF1  patients  with  17qll.2  microdeletions  are  cog¬ 
nitive  deficits  ranging  from  decreased  IQ  value  to 
mental  retardation  and  developmental  delay  at  a 
moderate,  but  significant  level.  Although  there  is  a 
high  incidence  of  learning  disability  in  NF1  pa¬ 
tients,  mental  retardation  and  cognitive  impair¬ 
ment  are  observed  in  only  4-8%  of  all  NF1  pa¬ 
tients  (North,  1999).  In  view  of  the  high  frequency 
of  mental  impairment  in  NF1  patients  with  large 
deletions,  it  is  tempting  to  speculate  that  one  or  a 
combination  of  the  adjacent  genes  account  for  sub¬ 
tle  developmental  defects  of  the  central  nervous 
system.  All  three  patients  (Table  1)  that  share 
hemizygosity  for  the  nine  distal  genes  are  mentally 
handicapped,  suggesting  that  one  of  these  genes 
plays  a  critical  role  in  normal  brain  function  or 
development. 
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10  INTRODUCTION 

11  Much  of  our  current  understanding  of  tumori genesis 

12  is  founded  on  genetic  studies  of  relatively  rare  individ- 

13  uals  with  inherited  disorders  that  predispose  to  certain 

14  cancers,  e.g.,  retinoblastoma[1]  and  colorectal  tu- 

15  mors.t23]  Such  genetic  studies  are  consistent  with  a 

16  model  whereby  normal  tissues  become  highly  malignant 

17  due  to  successive  mutation  of  multiple  genes  that 

18  dysregulate  cellular  proliferation  and  homeostasis. 

19  Important  classes  of  mutated  genes  include  oncogenes 

20  (positive  growth  regulators),  tumor  suppressors  (nega- 

21  tive  growth  regulators),  and  those  encoding  cell  cycle 

22  regulators,  antiapototic  signals,  and  components  of  the 

23  DNA  replication  and  repair  machinery.  Mutation  during 

24  tumorigenesis  can  occur  at  the  nucleotide  level  as 

25  inactivation  of  a  single  gene  or  at  the  chromosomal  level 

26  as  losses  of  large  segments  or  entire  chromosomes,  as  a 

27  fusion  of  two  different  chromosomal  segments,  or  as  a 

28  high  level  amplification  of  a  segment.141  Screening  nor- 

29  mal  and  tumor  tissues  of  patients  for  common  genetic 

30  alterations  has  been  a  productive  strategy  for  identifying 

31  genes  that  contribute  to  tumor  formation.  This  article 

32  focuses  on  genetic  changes  commonly  associated  with 

33  peripheral  nerve  sheath  tumors  in  individuals  with  the 

34  autosomal  dominant  tumor-prone  disorder  neurofibro- 

35  matosis  1  (NF1). 


Virtually  all  individuals  affected  with  NF1  develop  36 
multiple  peripheral  nerve  sheath  tumors.  The  most  com-  37 
mon  are  neurofibromas,  which  are  benign  tumors  that  38 
can  occur  anywhere  along  the  length  of  epidermal,  der-  39 
mal,  deep  peripheral  (including  dorsal  nerve  roots  in  the  40 
paraspinal  area),  or  cranial  nerves.  They  do  not  occur  in  41 
the  brain  or  the  spinal  cord  proper.[5]  Although  benign,  42 
neurofibromas  can  cause  considerable  morbidity  by,  43 
e.g.,  infiltrating  and  functionally  impairing  normal  tis-  44 
sues,  causing  limb  hypertrophy,  or  masking  an  emerging  45 
malignancy.  Some  types  of  neurofibromas  can  trans-  46 
form  into  malignant  peripheral  nerve  sheath  tumors  47 
(MPNST),  previously  termed  neurofibrosarcoma  or  48 
malignant  schwannoma.  An  estimated  20%-50%  of  49 
MPNSTs  are  associated  with  NF1  disease,1 [6]  and  they  50 
are  a  significant  cause  of  the  decreased  life  expectancy  51 
in  the  NF1  patient  population. [7,8]  Recent  population-  52 
based  longitudinal  studies  detected  an  annual  incidence  53 
of  1.6/1000  and  a  lifetime  risk  of  8%-13%  for  NF1-  54 
associated  MPNST, [9]  which  is  much  higher  than  de-  55 
tected  in  previous  cross-sectional  studies1101  and  over  56 
three  orders  of  magnitude  greater  than  that  of  the  gene-  57 
ral  population  (~0.001%).[n]  Furthermore,  NFl-asso-  58 
ciated  MPNST  were  diagnosed  at  an  earlier  age  than  59 
sporadic  tumors  (26  vs.  62  years,  p  <0.001)  and  asso-  60 
ciated  with  a  poorer  prognosis  than  sporadic  MPNST  61 
(5-year  survival  of  21  vs.  42%,  p  =  0.09).[91  62 
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63  This  article  reviews  the  genetic  abnormalities  that 

64  have  been  identified  at  the  level  of  the  gene*  chro- 

65  mosome,  and  genome  in  NF1 -associated  neurofibromas 

66  and  MPNST.  Identifying  commonly  associated  genetic 

67  alterations  and  the  mechanisms  by  which  they  arise 

68  may  potentially  lead  to  markers  for  tumor  staging,  to 

69  new  research  approaches  to  pathogenesis,  and  to  the 

70  identification  of  gene  targets,  which  in  conjunction 

71  with  NF1,  will  be  useful  for  mouse  models. 


72  THE  MOLECULAR  BASIS  OF  NF1 

73  The  NF1  Is  a  common  autosomal  dominant  dis- 

74  order  that  affects  about  1  in  3500  individuals  world- 

75  wide.  About  30%-50%  of  cases  are  sporadic,  caused 

76  by  de  novo  mutation  in  the  NF1  gene  of  an  individual 

77  without  a  family  history  of  the  disorder.  In  addition 

78  to  predisposing  to  tumorigenesis,  NF1  is  associated 

79  with  characteristic  changes  in  pigmentation  and  can  be 

80  associated  with  a  wide  range  of  other  manifestations 

81  such  as  learning  disabilities  and  bony  abnormalities 

82  (reviewed  by  Refs.  [12-16]).  All  cases  are  caused  by 

83  mutation  of  the  NF1  gene  at  chromosome  17  band 

84  qll.2,  which  contains  60  exons  that  encode  the  2818 

85  amino  acid  protein  called  neurofibromin/17-211  Neuro- 

86  fibromin  is  widely  expressed,  predominantly  in  the 

87  central  nervous  system  and  sensory  neurons  and 

88  Schwann  cells  of  the  peripheral  nervous  system. 122,231 

89  One  functional  domain  of  neurofibromin  has  been 

90  defined,  the  GAP-related  domain  (GRD),  so  called 

91  because  of  Its  structural  and  functional  homology  to 

92  mammalian  pi  20-GAP  (GTPase  activating  protein)  and 

93  yeast  genes  known  to  regulate  the  Ras  pathway/24,251 

94  The  Ras-GAP  proteins  function  as  negative  regulators 

95  of  Ras  by  catalyzing  the  conversion  of  active  GTP- 

96  bound  Ras  to  the  Inactive  GDP-bound  Ras  form/261  In 

97  NF1 -associated  peripheral  nerve  sheath  tumors,  it  is 

98  hypothesized  that  neurofibromin  deficiency  leads  to 

99  Increased  activated  Ras,  resulting  in  aberrant  mitogenic 

100  signaling  and  the  consequent  growth  of  a  tumor.  The 

101  Identification  of  an  NF1  patient  with  a  missense  muta- 

102  tion  in  the  GRD  that  specifically  abolished  the  Ras- 

103  GAP  activity  of  neurofibromin  demonstrated  the 

104  importance  of  neurofibromin  GAP  function  In  NF1 

105  pathogenesis.1271  In  at  least  some  tissues,  there  is  evi- 

106  dence  that  it  is  the  Ras-GAP  activity  that  accounts  for 

107  the  tumor  suppressor  function  of  neurofibromin.  The 

108  most  complete  evidence  comes  from  genetic  and  bio- 

109  chemical  analyses  of  NF1 -associated  malignant  mye- 

110  loid  disorders/28,291  The  tumor  suppressor  function  of 

111  NF1  has  been  reviewed  reeently[30“321  along  with  its 

112  potential  as  a  therapeutic  target/331 
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The  NF1  disease  is  caused  by  haploinsufficiency  for  113 
neurofibromin.  The  vast  majority  of  NF1  mutations  are  114 
not  in  the  GRD  but  are  distributed  throughout  the  gene.  115 
Over  80%  of  mutations  inactivate  or  predict  inactivation  116 
of  neurofibromin;  splicing  defects  and  sequence  altera-  117 
tions  that  create  a  premature  translation  termination  co-  118 
don  are  the  most  common/34,351  About  10%  of  NF1  119 
mutations  are  missense/35,365  which  are  typically  clus-  120 
tered  in  the  GRD  or  an  upstream  cysteine/serine-rich  121 
domain  that  may  play  a  role  in  ATP  binding/361  Defi-  122 
nitive  evidence  that  neurofibromin  haploinsufficiency  123 
underlies  NF1  came  with  the  identification  of  NF1  whole  124 
gene  deletions  in  an  estimated  5% -10%  of  affected  125 
individuals/37,385  Mutational  analyses  of  patients  with  126 
specific  features,  such  as  plexiform  neurofibroma/391  127 
spinal  neurofibroma/401  or  malignant  myeloid  disor-  128 
ders/411  failed  to  detect  any  correlation  between  geno-  129 
type  and  phenotype.  A  notable  exception  is  the  subset  of  130 
patients  heterozygous  for  a  microdeletion  spanning  the  131 
entire  NF1  gene,  who  consistently  show  an  early  age  at  132 
onset  of  cutaneous  neurofibromas  (see  below).  133 

Virtually  all  NF1  patients  develop  neurofibromas.  134 
Neurofibromas  are  comprised  of  an  admixture  of  large-  135 
ly  Schwann  cells  and  fibroblasts,  along  with  mast  cells,  136 
endothelial  cells,  and  pericytes/42,431  Although  classi-  137 
fication  schemes  vary,  Friedman  and  RIccard/131  define  138 
four  types  of  neurofibromas.  Discrete  cutaneous  neuro-  139 
fibromas  of  the  epidermis  or  dermis  are  the  most  com-  140 
mon,  typically  appearing  near  or  at  puberty  and  in-  141 
creasing  in  number  to  over  100  by  the  fourth  decade  of  142 
life/441  They  are  a  localized  tumor  of  small  nerves  in  143 
the  skin  that  feels  fleshy  and  soft;  they  are  more  144 
prevalent  on  the  trunk  but  also  occur  frequently  on  the  145 
face  and  extremities.  Discrete  subcutaneous  neurofi-  146 
bromas  have  a  spherical  or  ovoid  shape,  feel  firm  or  147 
rubbery,  and  may  be  painful  or  tender.  Deep  nodular  148 
neurofibromas,  also  called  nodular  plexiform  neurofi-  149 
bromas,  involve  major  or  minor  nerves  in  tissues  be-  150 
neath  the  dermis.  On  gross  pathology,  they  appear  to  151 
grow  inside  the  peripheral  nerve,  causing  a  fusiform  152 
enlargement,  and  may  extend  the  entire  length  of  a  153 
nerve/431  Diffuse  plexiform  neurofibromas  have  a  cel-  154 
lular  composition  similar  to  that  of  cutaneous  neuro-  155 
fibromas,  but  in  contrast,  they  have  a  tendency  to  be-  156 
come  locally  invasive/5,431  Histologically,  this  tumor  is  157 
a  tangled  network  involving  multiple  nerve  fascicles  or  158 
branches  of  major  nerves  with  poorly  defined  margins  159 
that  makes  complete  surgical  resection  virtually  im-  160 
possible,  Plexiform  neurofibromas  can  be  superficial  161 
with  extensive  involvement  of  underlying  tissues,  or  162 
they  may  involve  deep  tissues,  particularly  in  the  era-  163 
niofacial  region,  paraspinal  structures,  retroperitoneum,  164 
and  gastrointestinal  tract/131  They  can  Infiltrate  soft  165 
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166  tissues  resulting  in  localized  hypertrophy  and  signifi- 

167  cant  functional  impairment.  In  contrast  to  other  types 

168  of  neurofibromas,  diffuse  plexiform  tumors  are  con- 

169  sidered  congenital  because  they  typically  become  evi- 

170  dent  in  infancy  or  childhood  and  rarely,  if  ever,  in  late 

171  adulthood. [45,46]  In  a  population-based  study,  32%  of 

172  individuals  with  NF1  had  a  plexiform  neurofibroma(s) 

173  on  physical  examination/473  Korf*53  recently  reviewed 

174  plexiform  neurofibromas. 

175  Plexiform  neurofibromas,  both  diffuse  and  nodular, 

176  are  at  a  greater  risk  of  transforming  to  an  MPNST 

177  than  other  types  of  neurofibromas  (reviewed  in 

178  Refs.  [5,43,48]).  Pathological  examination  of  MPNST 

179  from  individuals,  both  with  and  without  NF1,  most  often 

180  shows  an  association  with  a  neurofibroma. [  11,49-5 11 

181  These  data  suggest  that  an  early  step  in  MPNST  devel- 

182  opment  may  be  preneoplastic  process  in  the  nerve  sheath. 

183  Multiple  pathological  and  molecular  criteria  are  used  to 

184  evaluate  a  neurofibroma  for  malignant  transforma- 

185  tion/483  Several  lines  of  evidence  are  consistent  with 

186  cutaneous  and  plexiform  neurofibromas  and  MPNSTs 

187  being  clonal  tumors  that  arise  from  an  ancestral  Schwann 

188  cell  (see  below). 


189  GENETICS  OF 

190  NEUROFIBROMAGENESIS 

191  Homozygous  Inactivation  of  NF1 

192  Homozygous  inactivation  of  a  tumor  suppressor 

193  gene(s)  is  a  fundamental  mechanism  of  tumorigenesis.  It 

194  occurs  by  either  sequential  somatic  inactivation  of  both 

195  alleles  or  by  a  somatic  mutation  in  the  single  normal 

196  homolog  in  individuals  who  inherit  a  germline  mutation 

197  in  one  allele.  Somatic  inactivation  is  frequently 

198  associated  with  loss  of  heterozygosity  (LOH)  at  the 

199  tumor  suppressor  locus  and  at  multiple  flanking  loci 

200  (reviewed  in  Refs.  [52,53]).  Evidence  for  such  “2nd 

201  hit”  somatic  NF1  mutations  in  neurofibromas  has  been 

202  sought  in  support  of  the  hypothesis  that  neurofibromin 

203  functions  as  a  tumor  suppressor  in  Schwann  cells.  Initial 

204  reports  of  LOH  at  NF1 [543  have  been  confirmed  and 

205  extended  by  analyses  of  both  primary  tumor  tissue  and 

206  neurofibroma-derived  Schwann  cells.  At  least  25%  of 

207  neurofibromas  undergo  LOH  at  NFL[55'56]  The  cellular 

208  admixture  in  neurofibromas  can  mask  allelic  loss/55,563 

209  which  is  the  likely  explanation  for  reports  that  detected 

210  few,  if  any,  tumors  with  LOH/57-623  Compelling 

211  evidence  that  the  somatic  inactivation  of  the  NF1  gene 

212  itself  is  important  was  provided  by  the  identification  of  a 

213  4  bp  deletion  in  exon  4b  in  a  cutaneous  neurofibroma  of 

214  a  patient  with  a  germline  NF1  microdeletion. [63] 


903 

Subsequent  analyses  of  cDNA  from  neurofibromin-  215 
derived  Schwann  cells  showed  that  19%  of  neurofibro-  216 
mas  carried  somatic  intragenic  mutations,  which  were  217 
typically  mRNA  splicing  defects. [34]  Mutation  frequen-  218 
cy  may  be  underrepresented  due  to  the  difficulty  of  219 
recovering  high-quality  tumor  RNA  and  the  underrep-  220 
resentation  of  mutant  transcripts  observed  in  some  221 
tumors.  The  latter  could  be  attributed  to  mutations  that  222 
induce  nonsense-mediated  decay  or  other  mechanisms  223 
that  affect  mRNA  content  (reviewed  in  Ref.  [64])  or  to  224 
reduced  expression  for  other  reasons/343  and/or  a  low  225 
proportion  of  mutant  to  normal  Schwann  cells  in  some  226 
tumors/65,663  Homozygous  inactivation  of  NF1  also  227 
occurs  in  plexiform  neurofibromas,  where  an  estimated  228 
40%  (n  =  10)  showed  LOH/673  a  result  confirmed  by  229 
subsequent  reports. [57,58,68,693  230 

A  predominant  mechanism  of  somatic  NF1  inac-  231 
tivation  in  neurofibromas  that  underwent  LOH  was  232 
mitotic  recombination/563  A  17q  proximal  single  mito-  233 
tic  recombination  event  near  the  centromere  of  the  q  234 
arm  between  the  normal  chromosome  17  and  the  ho-  235 
molog  carrying  the  germline  NF1  mutation  can  gene-  236 
rate  a  cell  in  which  both  NF1  genes  carry  the  germline  237 
mutation  and  all  loci  distal  to  the  recombination  site  238 
are  identical.  Less  common  were  double  recombination  239 
events  that  result  in  chromosome  17  interstitial  loci  240 
showing  LOH.  The  NF1  mRNA  in  some  neurofibromas  241 
is  edited  such  that  an  arginine  codon  is  changed  to  a  242 
nonsense  codon/703  Although  only  one-third  of  neuro-  243 
fibromas  examined  showed  a  low  level  (<2.5%)  of  244 
edited  NF1  transcripts,1713  such  modulation  of  neurofi-  245 
bromin  expression  may  be  important  if,  e.g.,  editing  246 
occurred  at  high  frequency  in  transcripts  from  a  spe-  247 
cific  minor  cellular  component  of  the  tumor.  In  other  248 
neurofibromas,  somatic  mutations  appear  to  destabilize  249 
NF1  mRNA/663  Transcriptional  silencing  via  hyper-  250 
methylation  of  promoter  regions,  a  prominent  mecha-  251 
nism  of  inactivating  other  tumor  suppressor  genes/723  252 
has  not  been  detected  in  neurofibroma  tissue/73,743  As  253 
expected  from  the  mutational  and  LOH  analyses,  some  254 
neurofibromas  had  no  detectable  NF1  transcripts  or  255 
neurofibromin/75,763  A  quantitative  Ras  activity  assay  256 
demonstrated  that  activated  Ras-GTP  levels  were  about  257 
fourfold  higher  in  neurofibromas  than  levels  in  non-  258 
NF1 -associated  schwannomas/75,773  2  59 

Neurofibromas  are  most  likely  clonal  tumors  de-  260 
rived  from  a  Schwann  cell  progenitor.  The  detection  of  261 
LOH  in  a  tumor  operationally  defines  it  as  being  clonal  in  262 
origin,  and  these  data  are  consistent  with  direct  marker  263 
analyses  in  neurofibromas/57,613  The  LOH  or  other  264 
somatic  mechanisms  that  inactivate  NF1  in  a  Schwann  265 
cell  progenitor  may  be  an  early  or  initiating  genetic  event  266 
in  neurofibromagenesis.  Klewe  and  colleagues  showed  267 
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268  LOH  in  primary  neurofibroma  tissue  and  in  tumor- 

269  derived  Schwann  cells  but  not  fibroblasts/781  Other  in- 

270  vestigators  confirmed  this  observation/66,681  Two  genet- 

271  ically  distinct  Schwann  cell  populations,  NFl+/~  and 

272  NFl~f~,  were  successfully  cultured  from  10  mutation- 

273  characterized  neurofibromas,  whereas  tumor-derived 

274  fibroblasts  carried  only  the  germline  NF1*^~  geno- 

275  type/651  Sherman  et  al/791  used  an  elegant  single  cell 

276  Ras-GTP  assay  to  show  elevated  levels  in  neurofibroma- 

277  derived  Schwann  cells  but  not  fibroblasts.  Consistent 

278  with  two  Schwann  cell  populations,  only  a  fraction 

279  ( 1 2%  -  62%)  of  neurofibroma-derived  Schwann  cells  had 

280  elevated  Ras-GTP  levels.  The  basis  of  the  Schwann  cell 

281  heterogeneity  is  not  known,  but  the  authors  speculate  that 

282  the  cells  with  high  Ras-GTP  (presumably  neurofibromin- 

283  deficient)  may  recruit  Schwann  cells  with  lower  Ras- 

284  GTP  levels  (presumably  the  constitutional  neurofibro- 

285  min-haploinsufficient  cells)  via  the  synthesis  of  growth 

286  factors.  Whether  this  admixture  of  Schwann  cells  is 

287  important  in  tumorigenesis  remains  to  be  determined; 

288  however,  its  observation  in  primary  neurofibroma  tissues 

289  makes  it  more  likely.  Fluorescence  in  situ  hybridization 

290  and  immunohistochemistry  demonstrated  that  S-100 

291  protein  (Schwann  cell  marker)  immunopositive  cells  in 

292  sections  of  four  of  seven  primary  plexiform  neurofibro- 

293  mas  were  monosomic  for  WFi/803  Because  other 

294  cells  types  were  disomic  at  i¥Fl,  these  results  strongly 

295  implicate  the  Schwann  cell  as  the  target  of  the  NF1  2nd 

296  hit  mutation  and  verify  that  the  results  from  neurofibro- 

297  ma-derived  Schwann  cells  were  not  biased  by  cell 

298  culture.  These  data  also  support  Schwann  cell  genetic 

299  heterogeneity,  because  the  fraction  of  S-100  protein 

300  positive  cells  showing  NF1  deletion  ranged  from  50%  - 

301  93%,  Further  evidence  for  the  importance  of  Schwann 

302  cells  in  neurofibromagenesis  comes  from  the  tumor- 

303  igenic  properties  exhibited  by  neurofibroma-derived 

304  cells.181  “841  In  the  most  comprehensive  studies,  Muir 

305  and  colleagues18'5*845  showed  that  neurofibromin-defi- 

306  dent  Schwann  cells,  derived  from  either  dermal  or 

307  plexiform  neurofibromas,  had  high  invasive  potential 

308  and  produce  neurofibroma-like  tumors  when  engrafted 

309  into  peripheral  nerves  of  scid  mice. 

310  Germline  Alterations  That  May 

311  Modify  Neurofibromagenesis 

312  The  considerable  variable  expressivity  of  NF1 

313  disease  among  family  members  with  presumably  the 

314  same  NF1  mutation  and  results  of  a  statistical  trait 

315  analysis  led  to  the  contention  that  variation  in  an 

316  individual’s  genetic  background  modified  the  NF1 

317  phenotype.146,851  To  date,  no  such  germline  modifying 


genes  have  been  identified.  However,  the  study  of  318 
patients  carrying  NF1  microdeletions  has  provided  319 
compelling  evidence  for  a  gene  that  modifies  neurofi-  320 
bromagenesis.  We  first  recognized  that  NF1  micro-  321 
deletion  patients  typically  showed  an  early  age  at  onset  322 
of  localized  cutaneous  neurofibromas  or,  in  cases  in  323 
which  age  at  onset  could  not  be  documented,  they  had  324 
significantly  greater  numbers  of  cutaneous  neurofibro-  325 
mas  relative  to  their  age.138,861  It  is  common  to  observe  326 
multiple  cutaneous  neurofibromas  during  physical  327 
examination  of  children  under  age  10  that  are  328 
heterozygous  for  an  NF1  microdeletion,138,87,881  where-  329 
as  only  about  10%  in  the  general  NF1  population  have  330 
a  tumor(s)  by  that  age/4/]  In  our  most  severe  ease,  a  5  331 

year  old  with  an  NF1  microdeletion  had  51-100  332 
tumors/381  The  NF1  microdeletions  can  be  inherited  333 
from  an  affected  parent,  and  in  these  cases,  the  micro-  334 
deletion  cosegregated  with  the  early  age  at  onset  of  335 
neurofibromas/88,891  Other  investigators  have  con-  336 
firmed  the  early  onset  and  heavy  burden  of  cutaneous  337 
neurofibromas  in  NF1  microdeletion  patients,  and  over  338 
50  such  patients  have  been  identified/90”981  339 

Most  NFl  microdeletions  span  1.5  MB  of  DNA/871  340 

which  harbors  the  entire  350  kb  NFl  gene  and  at  least  341 
11  additional  genes,  most  of  unknown  function/87,991  342 
The  microdeletions  are  preferentially  maternal  in  ori-  343 
gin[S7,9l,93tl00,i011  and  arise  by  homologous  recombina-  344 
tion  between  60  kb  misaligned  paralogs  (termed  345 

NF1REP)  that  flank  the  NFl  gene/871  Paralogs  are  346 

nonallelic  DNA  sequences  with  a  high  degree  of  identity  347 

that  arose  via  duplication,  e.g.,  the  two  functional  ex  348 
globin  genes.  Over  25  human  disorders  are  known  to  be  349 
caused  by  gene  or  chromosomal  rearrangements  medi-  350 
ated  by  homologous  recombination  between  paralogs,  a  351 
process  correctly  referred  to  as  nonallelic  homologous  352 
recombination  in  the  review  by  Stankiewicz  and  353 

Lupski/1021  Here,  I  propose  that  this  process  be  called  354 
by  the  less  awkward  term  of  paralogous  recombination  355 
and  that  disorders  arising  by  this  mechanism  be  called  356 
paralogous  recombination  disorders,  rather  than  the  cur-  357 
rently  used  genomic  disorders.  Although  the  NFlREPs  358 
share  >95%  sequence  identity  over  60  kb  of  sequence,  359 
about  80%  of  the  NFl  deletion  alleles  are  virtually  360 
identical  because  their  breakpoints  map  to  one  of  either  361 
two  recombination  hotspots  of  several  kb  in  size  located  362 
within  the  NF1REP[1031  (M,  Dorschner  et  aL,  in  pre-  363 
paration).  Although  the  molecular  basis  for  these  364 
hotspots  is  not  yet  known,  their  existence  means  that  365 
the  majority  of  microdeletion  patients  will  be  deleted  366 
for  the  same  set  of  genes.  The  early  age  at  onset  of  367 
cutaneous  neurofibromas  observed  in  several  pa-  368 
tients  wdlh  larger  deletions  and/or  different  breakpoints  369 
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370  indicates  that  generation  of  a  deletion-specific  fusion 

371  gene  product  is  unlikely.*871 

372  We  hypothesized  that  the  codeletion  of  NF1  and 

373  an  unknown  linked  gene  potentiates  cutaneous  neurofi- 

374  bromagenesis.*38’871  What  might  be  the  function  of  the 

375  putative  neurofibromagenesis-potentiating  locus  ( NPL)1 

376  Here  I  propose  two  models  for  the  early  age  at  onset 

377  of  cutaneous  neurofibromas  in  microdeletion  patients. 

378  Haploinsufficiency  for  NPL  could  increase  the  fre- 

379  quency  of  somatic  2nd  hit  mutations  in  the  NF1  gene. 

380  This  could  result  from  a  genomic  instability  in 

381  microdeletion  patients,  which  is  intriguing  in  view  of 

382  reports  detecting  cytogenetic  abnormalities  and  micro- 

383  satellite  instability  in  some  neurofibromas  (see  below). 

384  It  would  be  interesting  to  determine  if  somatic  NF1 

385  mutations  in  cutaneous  neurofibromas  of  microdeletion 

386  patients  occur  by  a  predominant  mechanism.  Only  a 

387  single  tumor  from  each  of  two  deletion  patients  have 

388  been  analyzed;  one  had  a  4  bp  intragenic  deletion[63] 

389  and  the  other  a  splice  site.*651  A  second  model  for  early 

390  onset  of  cutaneous  neurofibromas  proposes  that  NPL 

391  haploinsufficiency  increases  the  probability  that  a 

392  neurofibromin-deficient  progenitor  cell  proliferates 

393  and  manifests  as  a  neurofibroma.  Multiple  mechanisms 

394  could  be  proposed.  For  example,  NPL  could  encode 

395  (or  regulate)  a  cytokine,  cell  cycle  regulator,  tumor 

396  suppressor,  or  oncogene,  which  exerts  a  positive  pro- 

397  liferative  advantage  on  the  progenitor  cell.  Because  of 

398  the  cellular  heterogeneity  of  neurofibromas,  this  model 

399  does  not  necessarily  require  that  the  abnormal  expres- 

400  sion  of  NPL  or  its  putative  downstream  targets  occur  in 

401  the  neurofibromin-deficient  Schwann  clone. 

402  Preliminary,  but  intriguing  evidence  suggests  that 

403  genetic  background,  other  than  NF1  microdeletion, 

404  may  influence  the  somatic  inactivation  of  NFL  In 

405  patients  with  multiple  neurofibromas,  it  was  deter- 

406  mined  that  each  of  the  tumors  showed  the  same  type  of 

407  somatic  mutation  event  (e.g.,  LOH  of  the  entire  q  arm 

408  or  interstitial  LOH).*561  Depending  on  the  extent  of 

409  LOH  and  the  particular  genes  involved,  this  could 

410  explain  differences  in  the  age  at  onset  and/or  numbers 

411  of  neurofibromas  that  develop  in  an  individual. 

412  Somatic  Alterations  That  May 

413  Modify  Neurofibromagesis 

414  The  NFl-associated  neurofibromas  have  been 

415  analyzed  for  genetic  abnormalities  at  the  chromosomal 

416  level  by  comparative  genome  hybridization  (CGH)  and 

417  cytogenetic  analyses.  Comparative  genome  hybridiza- 

418  tion  is  a  powerful  technique  to  detect  and  map  chromo- 

419  somal  regions  with  copy  number  imbalances  in  tumor 
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specimens  (reviewed  in  Ref.  [104]).  Only  two  of  eight  420 
neurofibromas  (type  not  specified)  examined  showed  421 
chromosomal  imbalances;  one  tumor  showed  three  422 

gains;  the  other  only  a  single  gain.*105,1061  This  obser-  423 
vation  is  consistent  with  cytogenetic  studies.  Although  424 
Schwann  cell  cultures  from  dermal  neurofibromas  had  425 
normal  karyotypes,  cells  derived  from  plexiform  neuro-  426 
fibromas  had  abnormalities,  which  in  some  tumors  427 
consisted  of  unrelated  non-clonal  abnormalities.*1071  428 
One  plexiform  had  structural  abnormalities  predomi-  429 
nantly  involving  telomeres,  which  are  typically  associ-  430 
ated  with  genomic  instability  in  other  syndromes/  431 

tumors.*1071  Wallace  et  al.  proposed  that  chromosomal  432 
abnormalities  might  be  important  in  the  development  433 
of  plexiform  neurofibromas.  Chromosomal  abnormali-  434 
ties  in  plexiform  neurofibromas  may  account,  at  least  435 

in  part,  for  their  increased  risk  of  malignant  transfor-  436 

mation.  Whether  other  cellular  components  of  neurofi-  437 
bromas  show  cytogenetic  abnormalities  is  unclear.  438 
Some  neurofibroma  fibroblast-like  derived  cultures  439 
were  reported  to  show  an  increased  frequency  of  chro-  440 
mosomal  aberrations,*1081  whereas  others  were  typically  441 
negative.*1091  442 

Conflicting  data  have  been  reported  regarding  the  443 
presence  of  microsatellite  instability  in  NFl-associated  444 
neurofibromas.  Some  human  tumors,  most  notably  445 
those  of  patients  with  hereditary  nonpolyposis  colon  446 
cancer  HNPCC,  show  microsatellite  instability,  which  447 
is  detected  by  random  changes  in  the  length  of  448 
microsatellite  (simple  nucleotide  repeats)  loci.  Length  449 
mutations  at  multiple  microsatellite  loci  in  a  tumor  450 
reflect  a  genome- wide  instability,  which  in  the  case  of  451 
HNPCC  is  due  to  a  defect  in  any  of  several  mismatch  452 

repair  genes  (reviewed  in  Ref.  [2]).  Ottini  et  al.  [n°]  453 

reported  that  50%  (n  =  16)  of  neurofibromas  showed  454 

altered  allele  lengths  compared  with  matched  normal  455 

tissue,  and  instability  at  chromosome  9  loci  has  also  456 

been  reported.*1111  Birindelli  et  al.*112]  observed  457 
instability  in  a  primary  MPNST  and  a  metastasis  in  458 
one  of  25  cases.  However,  no  evidence  of  instability  459 

was  detected  in  two  subsequent  studies  of  80  neuro-  460 

fibromas,  of  which  5%  appear  to  be  of  the  plexiform  461 
type.*55,731  This  disparity  may  be  due  to  technical  dif-  462 
ferences,  the  number  and  type  of  microsatellite  loci  463 

examined,  and/or  the  stage  of  the  neurofibromas.  The  464 

LOH  at  NF1  was  not  a  factor,  because  all  three  studies  465 

analyzed  neurofibromas  that  were  both  positive  and  466 
negative.  Of  the  eight  neurofibromas  with  microsatel-  467 
lite  instability,  seven  were  unstable  at  only  one  of  the  468 
five  loci  tested.*1101  Due  to  the  important  implications  469 
that  microsatellite  instability  would  have  for  neurofi-  470 
bromagenesis,  additional  loci  should  be  examined  in  471 
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472  these  tumors,  including  those  analyzed  by  the  other 

473  investigators.  In  addition,  it  would  be  interesting  to 

474  know  if  the  neurofibromas  that  showed  microsatellite 

475  instability  were  of  the  larger,  central  piexiform  type,  in 

476  which  case  they  may  have  been  transforming  to  malig- 

477  nancy.  The  proportion  of  loci  that  display  instability 

478  is  known  to  increase  with  tumor  progression  in 

479  HNPCC.1"31 


480  GENETICS  OF  MPNST 

481  DEVELOPMENT 

482  Homozygous  Inactivation  at  NF1 

483  The  homozygous  inactivation  of  NF1  in  NF1- 

484  associated  MPNST,  first  reported  by  Skuse  et  al.,[6°3 

485  has  been  confirmed  in  about  50%  of  tumors  (n  =  22)  by 

486  LOH/67’109’112,114’1153  ]^4utatl0ns  in  both  NF1  alleles 

487  have  been  identified  in  a  single  MPNST,11143  Although 

488  the  mechanism  of  LOH  is  not  known,  it  does  not 

489  generally  involve  cytogenetically  detectable  losses  at 

490  17ql.11 163  Furthermore,  NF1 -associated  MPNST-derived 

491  cell  lines  showed  decreased  or  absent  neurofibromin  and 

492  high  levels  of  active  Ras-GTP,[1 17,1 18]  and  a  quantitative 

493  Ras  activity  assay  demonstrated  that  activated  Ras-GTP 

494  levels  in  tumors  were  about  15-fold  higher  than  levels  in 

495  non-NFl  -associated  schwannomas.17 73  Because  homo- 

496  zygous  inactivation  of  NF1  occurs  in  benign  neurofi- 

497  bromas,  it  is  considered  an  early  or  initiating  event  that  is 

498  necessary  and  sufficient  for  neurofibromagenesis  but  not 

499  MPNST  development.  Malignant  transformation  is 

500  presumably  driven  by  predisposing  genetic  factors  in 

501  the  germline  and/or  by  additional  somatic  mutations  and 

502  positive  growth  selection  in  a  malignant  precursor  cell. 

503  The  role  of  NF1  in  the  development  of  sporadic  MPNST 

504  Is  not  clear;  only  about  10%  of  these  tumors  show  LOH 

505  at  NFL1 1121 

506  Germline  Alterations  That  May 

507  Modify  MPNST  Development 

508  Little  is  known  about  germline  genetic  modifiers 

509  that  predispose  to  MPNST.  Early  speculation  that 

510  patients  with  NF1  microdeletion  may  be  at  increased 

511  risk  for  malignancy [3S1  is  now  supported  by  indirect 

512  evidence.  Mutational  analysis  of  germline  DNA  from 

513  seven  patients  who  developed  MPNST  determined  that 

514  three  (42%)  were  heterozygous  for  an  NF1  microdele- 

515  tion/1193  In  another  study,  2  of  17  (1 1%)  unrelated  NF1 

516  microdeletion  patients  developed  MPNST,  and  affected 

517  first-degree  relatives  of  two  microdeletion  patients  had 


other  malignancies.1873  Although  additional  cases  are 
needed,  these  data  suggest  that  the  lifetime  risk  of 
MPNST  in  microdeletion  patients  may  exceed  the 
already  high  10%  in  the  general  NF1  population.193  If 
so,  the  underlying  mechanism  may  be  essentially  the 
same  as  that  proposed  above  for  early  onset  neurofi¬ 
bromagenesis.  Deletion  of  the  putative  NPL  gene  could 
result  In  genomic  instability  or  exert  positive  growth 
selection  for  the  malignant  clone.  If  microdeletions 
do  predispose  to  both  cutaneous  neurofibromas  and 
MPNST,  it  seems  reasonable  to  speculate  that  in  at  least 
this  subset  of  patients,  either  the  discrete  cutaneous 
type  of  neurofibroma  is  at  increased  risk  of  malignant 
transformation  or  the  frequency  of  nodular  or  diffuse 
piexiform  neurofibromas  is  high. 

Evidence  from  two  families  suggests  the  intriguing 
possibility  that  MLH1  deficiency  predisposes  to  NF1 
and  early  onset  extracolonic  tumors/120’1213  Germline 
heterozygous  inactivating  mutations  in  MLH1  cause 
inefficient  DNA  mismatch  repair,  with  the  consequent 
increase  in  mutation  frequency  and  susceptibility  to 
hereditary  nonpolyposis  colorectal  cancer  (reviewed  in 
Ref,  [122]),  Two  rare  and  independent  cases  of  con¬ 
sanguineous  marriages  between  MLHl  heterozygous 
first  cousins  each  produced  two  children  with  NF1 
disease  and  hematological  malignancies/120, 12  ‘ 3  The 
parents  had  no  signs  of  NF1  and  there  was  no  family 
history  of  the  disease.  The  MLHl  mutations  were 
identified,  confirming  homozygosity  in  three  of  the 
four  deceased  children,  who  presented  with  mutiple 
cafe  au  lait  spots  (4/4),  dermal  neurofibromas  (2/4), 
tibial  pseudoarthrosis  (1/4),  non-Hodgkin’s  lymphoma 
(2/4),  myeloid  leukemia  (2/4),  and  medulloblastoma 
(1/4).  The  authors  suggest  that  these  patients  had  a  de 
novo  postzygotic  NF1  mutation  and  that  the  NF1  gene 
may  be  preferentially  susceptible  to  mismatch  repair 
deficiency.  Unfortunately,  the  NF1  gene  was  not 
analyzed  for  mutations,  due  in  part  to  lack  of  patient 
tissue,*-1203  which  could  have  confirmed  mutation  of 
NF1  and  differentiated  between  postzygotic  mutation 
and  germline  mosaicism  In  a  parent. 

Somatic  Alterations  in 
NFI-Assoclated  MPNST 

All  NF1 -associated  MPNST  showed  significant 
chromosomal  imbalances  by  CGH/105,106’123’1241  Anal¬ 
ysis  of  27  total  tumors  from  19  NF1  patients  showed  an 
average  of  >7  imbalances  per  MPNST  (Table  1).  One 
tumor  had  only  a  single  imbalance,  a  gain  of  chro¬ 
mosome  8/1233  The  studies  led  by  Mechtersheimer  and 
Schmidt/105’1063  which  Include  74%  of  tumors  exam¬ 
ined,  detected  chromosomal  gains  more  frequently  than 
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Tl.l 

T1.2 

T1.3 

Table  1. 

Chromosomal  gains  and  losses  in  MPNST  detected  by  CGH. 

NFl-associated  MPNST 

Sporadic  MPNST 

Loethea 

Mechtersheimerb 

Schmidt0 

Loethe3 

Mechtersheimerb 

Schmidt' 

T1.4 

No.  tumors/No.  patients- 

7/7 

6/6 

14/6 

3/3 

13/13 

22/20 

T1.5 

Imbalances 

52 

77 

188 

14 

176 

200 

T1.6 

Per  tumor 

7.4 

11.7 

13.4 

4.6 

13.5 

9.1 

T1.7 

Range 

1-17 

6-30 

7-29 

1-7 

0-34 

0-25 

T1.8 

Chromosome  gains 

14 

48 

139 

4 

125 

179 

T1.9 

Per  tumor 

2 

8.0 

10.0 

1.3 

9.6 

8.1 

T1.10 

Range 

0-3 

4-18 

5-20 

0-2 

0-23 

0-21 

Tl.ll 

Chromosome  losses 

38 

29 

49 

10 

51 

33 

T1.12 

Per  tumor 

5.4 

4.8 

3.5 

3.3 

3.9 

1.5 

T1.13 

Range 

0-14 

1-12 

0-11 

1-4 

0-11 

0-9 

a:  Ref.  [123]. 
b:  Ref.  [105]. 
c:  Ref.  [106]. 
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569  losses  (Table  1).  Each  of  the  20  tumors  in  these  studies 

570  had  >4  chromosomal  gains.  Although  the  results  of 

571  these  two  studies  were  comparable,  they  differed 

572  from  a  third  study  that  found  chromosomal  losses 

573  more  prevalent  than  gains.*1231  The  reason  for  the  dis- 

574  parity  is  unclear;  however  it  was  also  evident  in  the 

575  analyses  of  sporadic  MPNST  in  each  study  (Table  1). 
T2  576  Table  2  summarizes  the  chromosomal  segments  that 

577  most  frequently  showed  gains  in  NFl-associated 

578  MPNST.  The  most  common  segments  were  on  17q22- 

579  q24,  17q25,  7pl4,  7p21,  8q22,  8q23-q24,  and  7q31. 

580  Chromosomal  loss  of  these  segments  was  rarely,  if  ever, 


observed.*105,106,1233  A  combined  analysis  of  sporadic 
and  NFl-associated  MPNST  revealed  a  significantly 
decreased  survival  rate  of  patients  with  MPNST  with 
gains  at  both  7pl5-21  and  17q22-qter.*124] 

In  addition  to  chromosomal  gains,  CGH  analyses 
also  revealed  large-scale  chromosomal  amplifica- 
tions.[105ll06J24]  One-third  of  MPNST  of  both  NF1  and 
non-NFl  patients  had  at  least  one  amplified  chromo¬ 
somal  segment  (Table  3).  Although  the  number  of 
tumors  is  small,  there  are  differences  in  amplification 
patterns.  In  NFl-associated  MPNST  7p  and  17q22-qter, 
the  same  regions  that  commonly  showed  chromosomal 


T2.1 

Table  2.  Frequency  of  chromosomal  gains  in  NFl-associated  MPNST. 

T2.2 

%  MPNST 

T2.3 

Loethea 

Mechtersheimerb 

Schmidt0 

Total 

T2.4 

No.  MPNST/No.  patients 

111 

6/6 

14/6 

27/19 

T2.5 

T2.6 

Chromosome 

lq33 

0 

50 

50 

37 

T2.7 

5pl5 

28 

50 

35 

37 

T2.8 

7pl4 

28 

83 

71 

63 

T2.9 

7p21 

28 

50 

64 

52 

T2.10 

7q31 

28 

33 

64 

48 

T2.ll 

8q22 

28 

33 

64 

48 

T2.12 

8q23-q24 

14 

33 

85 

55 

T2.13 

15q24-q25 

0 

17 

71 

41 

T2.14 

17q22-q24 

42 

83 

85 

74 

T2.15 

17q25 

71 

83 

78 

78 

a:  Refs.  [105,123]. 
b:  Ref.  [105]. 
c:  Ref.  [106]. 
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593 

gains  (Table  1)  were  amplified  frequently.  The  fre¬ 

number  of  chromosomal 

segments  amplified  were 

604 

594 

quency  of  7p  amplifications  may  be  overestimated 

unlikely  to  be  due  to  tumor  grade,  because  the  majority 

605 

595 

because  these  four  MPNST  (considered  as  different 

of  MPNST,  14  of  20  (70%)  of  NF1 -associated  and  22  of 

606 

596 

primary  tumors)  were  from  a  single  patient.[  106,1 243  In 

34  (64%)  of  sporadic,  were 

5  grade  3  (poorly  differential- 

607 

597 

contrast*  amplifications  of  5p,  8q,  and  12q  were  the  most 

ed).  Although  amplifications  were  more  frequent  in  NF 1  - 

608 

598 

prevalent  in  sporadic  MPNST.  There  was  a  significant 

associated  than  sporadic  MPNST  (50%  vs.  32%),  the 

609 

599 

difference  in  the  frequency  of  tumors  with  more  than 

number  of  NF1 -associated  tumors  may  be  overestimated 

610 

600 

one  chromosomal  segment  amplified.  Only  10%  of 

because  it  includes  multiple  tumors  from  the  same 

611 

601 

NF1 -associated  MPNST  had  more  than  one  amplified 

patient.  Adjustment  for 

one  tumor/patient  gives  a 

612 

602 

segment*  whereas  the  frequency  was  72%  in  sporadic 

frequency  of  25%  (5  of  20)  for  NF1 -associated  MPNST. 

613 

603 

tumors  (Table  3).  Differences  in  the  location  and 

Similarly,  there  was  no  correlation  between  chromosomal 

614 

T3.1 

Table  3,  Large-scale  chromosomal  amplifications  in  MPNST.3 

T3.2 

Chromosome  segment  amplified  No.  NF1 -associated  MPNSTb  (N  =  20) 

No,  sporadic  MPNST  (N  =  34) 

T3.3 

4q  1 2-q  1 3 

1 

T3.4 

5pll-pl5 

1 

T3.5 

5pl4 

1 

T3.6 

5pl5 

2 

T3.7 

5pl3-pter 

2 

T3.8 

7pl4-pter 

2d 

1 

T3.9 

7pl3-pter 

ld 

T3.10 

7pl2-pter 

ld 

T3.ll 

7pl  l-pl2 

1 

T3.12 

8ql2-qter 

2 

T3.13 

8ql3 

1 

T3.14 

8q21-q22 

1 

T3.15 

8q22-q23 

1 

T3.16 

8q24 

1 

T3.17 

8q23-qter 

1 

T3.18 

9p21-p23 

le 

T3.19 

9q31-q33 

1 

T3.20 

12pl2 

le 

T3.21 

12pl3 

I 

T3.22 

12ql3-ql4 

1 

T3.23 

12q!4-q21 

3 

T3.24 

12q24 

3 

T3.25 

13ql3-q33 

1 

T3.26 

13q32-q33 

1 

T3.27 

17pl  l-pl2 

1 

T3.28 

17q24-qter 

2 

T3.29 

17q22-q24 

1 

T3.30 

20ql2-qter 

1 

T3.31 

Xpli 

1 

T3.32 

Xp21-p22 

1 

T3.33 

Summary 

T3.34 

%  tumors  with  an  amplification 

50  (10/20) 

32  (11/34) 

T3.35 

%  tumors  with  >1  amplified  segment 

0.1  (1/10) 

72  (8/11) 

T3.36 

%  patients  with  >  1  tumor  with  an  amplification 

30  (4/12) 

32  (11/34) 

a:  Centromerie  regions*  chromosomes  19  and  Y,  and  Ip32-p36  were  not  scored  for  technical  reasons.  (From  Ref.  [105].) 
b:  Refs.  [105*106*124]. 
c:  Refs.  [105*124]. 

d:  Four  of  five  primary  MPNST  from  one  patient.  (From  Ref.  [106].) 
e:  Two  of  four  primary  MPNST  from  one  patient.  (From  Ref.  [106].) 
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615  amplification  and  progression  to  metastasis.  Analysis  of 

616  tissue  from  a  primary  tumor,  recurrence,  and  metastasis 

617  from  a  single  patient  showed  a  single  amplification 

618  (8ql2-qter)  in  the  recurrent  tumor.[124] 

619  Comparison  of  the  chromosomal  losses  detected  in 

620  the  three  CGH  studies  revealed  the  following  five  most 

621  frequent  losses  in  27  NF1 -associated  MPNST:  13q21- 

622  22  (12/27),  llq23-25  (10/27),  lp22-31  (9/27),  3ql  1-21 

623  (8/27),  and  17pl2-pter  (7/27).tI05- 1061231  It  is  of  interest 

624  that  the  loss  of  17pl2-pter  was  detected  in  50%  (7  of 

625  14)  tumors  in  one  study[106]  and  in  none  of  the  tumors 

626  of  the  other  studies.  The  relatively  low  detection  rate  of 

627  chromosomal  losses  may  be  due  to  the  decreased 

628  sensitivity  of  CGH  in  polyploid  MPNST. [125] 

629  The  CGH  analysis  of  multiple,  presumably  syn- 

630  chronous  or  metachronus,  primary  MPNST  at  different 

631  sites  in  three  NF1  patients  revealed  a  remarkable 

632  similarity  in  chromosomal  gains  and  losses. [106,1241  For 

633  example,  five  grade  3  (poorly  differentiated)  tumors  of 

634  one  patient  each  showed  imbalances  of  +7p,  —  13q21, 

635  and  +  17q22-qter.  These  data  showed  that  in  the  speci- 

636  fic  genetic  background  of  each  patient,  a  relatively 

637  limited,  and  defined,  number  of  rearrangements  were 

638  shared  among  the  tumors.  Similarly,  nearly  identical 

639  aberrations  were  found  in  different  MPNST  from  the 

640  same  patient.[1251  Although  limited,  these  data  suggest 

641  that  each  individual’s  constitutional  genotype  sets  a 

642  certain  “baseline”  on  which  a  minimal  and  limited 

643  number  of  genetic  alterations  are  necessary  and  suf- 

644  ficient  for  MPNST  development. 

645  Consistent  with  CGH  analysis,  the  karyotype  of 

646  NF1 -associated  MPNST-derived  cells  are  complex  with 

647  chromosomal  numbers  ranging  from  34  to  270  indica- 

648  tive  of  hypodiploidy ,  hypotriploidy ,  hypotetraploidy,  hy- 

649  pertriploidy,  and  hypertetraploidy  (Refs.  [109,116,125] 

650  and  references  therein).  Although  breakpoints  were  fre- 

651  quent,  a  common  specific  breakpoint  was  not  detected. 

652  A  comparison  of  CGH  and  karyotying  in  six  MPNST 

653  revealed  significant  overlap  in  the  most  frequent  gains 

654  and  detected  losses  at  19q  (3  of  6  tumors),  a  region  not 

655  analyzed  in  the  CGH  studies. [106, 1253  Plaat  et  al.[1I6] 

656  performed  a  computer-assisted  cytogenetic  analysis  of 

657  46  MPNST  reported  in  the  literature  and  7  new  cases  of 

658  both  NF1 -associated  and  sporadic  tumors  (Ref.  [116] 

659  and  references  therein).  These  studies  confirmed  the 

660  CGH  observation  of  high-frequency  gains  at  chromo- 

661  somes  7p  and  7q  and  losses  at  lp  and  17p.  However, 

662  their  reported  cytogenetic  differences  between  NF1- 

663  associated  and  sporadic  MPNST[1161  were  not  con- 

664  firmed  in  later  studies. [  125,1261  In  one  study,  near  triploid 

665  or  near  tetraploid  clones  were  associated  with  grade  3 

666  tumors  and  a  poor  prognosis.[126]  The  detection  of  a 

667  t(X;18)  translocation  in  MPNST[l27]  was  not  confirmed 
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in  either  sporadic1- 1281  or  NF1 -associated  neurofibroma  668 
or  MPNST.[129]  669 

Inactivating  mutations  in  several  tumor  suppressor  670 
genes  have  been  identified  in  NF1 -associated  MPNST.  671 
Both  LOH  and  intragenic  missense  mutations  of  672 
TP53[50, 62,1 12,1301  have  been  detected.  Like  many  sarco-  673 
mas,  NF1 -associated  MPNST  often  showed  overexpres-  674 
sion  of  p53  (the  protein  product  of  the  TP53  gene)  as  675 
assayed  by  immunoreactive  positivity  in  the  nucle-  676 

us  [50,ii2,i3i,i32]  jn  keeping  with  findings  in  other  tumors,  677 
this  most  likely  is  mutant  p53  protein,  which  accumulates  678 
due  to  its  increased  stability.  Mutant  protein  is  thought  to  679 
promote  cancer  by  either  complexing  with  and  seques-  680 
tering  functional  p53  or  by  complexing  with  p63  and  p73  681 

and  blocking  their  normal  transcription  factor  activities  682 
(reviewed  in  Ref.  [133]).  Immunohistochemical  detec-  683 

tion  of  p53  was  more  common  in  NF1 -associated  vs.  684 

sporadic  MPNST,  and  it  was  associated  with  poor  685 

prognosis  in  NF1  children. [132]  About  50%  of  NF1-  686 

associated  MPNST  showed  homozygous  deletion  for  687 

exon  2  of  the  INK4A  gene,  and  over  90%  were  688 

immunonegative  for  its  protein  product  pl6.[1 12-134.i35]  6g9 

Homozygous  deletion  of  exon  2  results  in  deficiency  for  690 

both  pi 6  and  pl4ARF,  two  proteins  encoded  by  691 

alternative  splicing  of  1NK4A  (also  known  as  CDKN2A).  692 

Both  of  these  proteins  are  tumor  suppressors  that  693 

modulate  activities  of  the  RB  and  p53  pathways,  which  694 

are  critical  for  cell  cycle  control  and  tumor  surveillance  695 

(reviewed  in  Ref.  [136]).  MXI1  mutations  in  regions  that  696 

encode  known  functional  domains  have  been  detected  in  697 

the  two  NF1  -associated  MPNST  analyzed.[137]  Mxil  is  698 

an  agonist  of  the  oncoprotein  Myc  and  is  thought  to  limit  699 

cell  proliferation  and  help  maintain  the  differentiated  700 

state  (reviewed  in  Ref.  [138]).  Mxil -deficient  mice  701 

develop  tumors,  and  Mxil  deficiency  decreases  the  702 

latency  of  tumors  that  arise  in  Ink4a-deficient  mice.[139]  703 

If  additional  studies  show  that  somatic  inactivation  of  704 

MXI1  is  common,  it  would  suggest  a  link  between  the  705 

pathways  of  Ink4a,  Myc,  and  Ras  in  NF1- associated  706 

MPNST.[140>,41]  707 

Several  differences  observed  in  NF1  -associated  708 

MPNST  are  likely  involved  in  the  malignant  transfor-  709 

mation  of  a  preexisting  neurofibroma.  TP53  or  1NK4A  710 

mutations/altered  expression  were  not  detected  in  711 

neurofibromas.150,5 1,54,62,67,69,1 12,132,134,1351  Further-  712 

more,  p53-positive  nuclei,  typically  associated  with  713 

MPNST,  were  observed  in  a  few  cells  at  the  transitional  714 

zone  between  an  existing  plexiform  neurofibroma  and  715 

an  arising  MPNST. f5 11  One  plexiform  neurofibroma  716 

proximal  to  an  MPNST  did  show  pl6  immunonegativ-  717 

ity.[1121  Microdissection  of  a  preexisting  neurofibroma  718 

from  its  MPNST  focal  malignant  process  revealed  5  719 

chromosomal  imbalances  in  the  neurofibroma,  all  of  720 
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721  which  were  novel  compared  to  the  10  imbalances  in  the 

722  MPNST  component.11061  These  data  are  consistent  with 

723  a  restructuring  of  the  genome  during  transformation. 

724  An  additional  distinguishing  feature  of  MPNST  is  the 

725  high  labeling  index  of  the  nuclear  proliferating  antigen 

726  Ki67,  which  was  correlated  with  reduced  survival  in  a 

727  study  that  combined  NF1 -associated  and  sporadic 

728  MPNST.151’13211425 


729  OTHER  NF1-ASSOCIATED 

730  NEOPLASMS  AND 

731  MOUSE  MODELS 

732  In  addition  to  peripheral  nerve  sheath  tumors, 

733  individuals  affected  with  NF1  are  at  increased  risk  for 

734  an  array  of  other  tumors.  Epidemiologlcally  associated 

735  neoplasms  include  medulloblastoma,  pheochromocyto- 

736  ma,  astrocytoma,  and  adenocarcinoma  of  the  ampulla 

737  of  Vater  (Ref.  [143]  and  references  therein).1  i44] 

738  Primarily  children  affected  with  NF1  are  at  increased 

739  risk  for  optic  pathway  gliomas  and  brainstem  gliomas, 

740  rhabdomyosarcomas,  and  malignant  myeloid  leuke- 

741  rniasJ41*145”1491  The  NF1  patients  are  also  at  increased 

742  risk  for  a  second  malignancy,  some  of  which  may  be 

743  treatment-related.*146’150’1511  In  different  studies,  8%- 

744  21%  of  NF1  patients  with  a  first  malignancy  developed 

745  a  second  cancer,  compared  to  a  frequency  of  4%  In  the 

746  general  population.114*51  Malignancy  in  NF1  has  been 

747  reviewed  recently.*1521 

748  Although  heterozygous  mutant  mice  (Nfl*l~) 

749  develop  tumors,  they  are  not  the  characteristic  peri- 

750  pheral  nerve  sheath  tumors  characteristic  of  the  human 

751  disease.*153-1551  The  Nfl -deficient  mice  (i¥fi“/“) 

752  die  in  utero  from  cardiac  defects.  Mice  chimeric  for 

753  f Nfl~f~ )  and  (Nfl^/+)  cells  were  able  to  develop  many 

754  microscopic  plexiform  neurofibromas,  but  dermal 

755  tumors  did  not  develop.  ** 561  In  addition,  mice  doubly 

756  heterozygous  for  mutations  in  Nfl  and  p53  developed 

757  MPNST  that  showed  LOH  at  both  tumor  suppressor 

758  loci.*156,1571  Recently,  mice  were  constructed  such  that 

759  only  their  Sclrwann  cells  were  Nfl  deficient.*1581 

760  Different  tumor  phenotypes  were  observed,  depending 

761  on  whether  the  Nfl -deficient  Schwann  cells  were  in  an 

762  animal  with  an  Nfl+/~  or  an  Nfl*/+  constitutional 

763  genotype.  In  the  Nfl+/~  genetic  background,  plexiform 

764  neurofibromas  composed  of  Nfl -deficient  Schwann 

765  cells,  and  the  fibroblasts  and  mast  cells  that  normally 

766  occur  in  human  neurofibromas,  developed  on  periphe- 

767  ral  nerves.  In  the  Nfl+/*  genetic  background,  Nfl- 

768  deficient  Schwann  cells  did  not  participate  in  neurofi- 

769  bromagenesls  but  did  form  relatively  small  hyperplastic 

770  lesions  of  the  cranial  nerves  containing  minimal  mast 


Stephens 

cells.  This  mouse  model  demonstrates  that  neurofibro- 
min  deficiency  in  Schwann  cells  is  sufficient  for 
generating  nascent  tumor  lesions,  but  frank  plexiform 
neurofibroma  development  requires  neurofibromin 
haploinsufficiency  in  cells  of  the  surrounding  tissues. 

With  the  development  of  mouse  models,  under¬ 
standing  the  genetics,  pathology,  and  natural  history  of 
human  benign  and  malignant  peripheral  nerve  sheath 
tumors  takes  on  new  Importance.  It  is  only  by  accurate 
modeling  of  the  human  disease  that  progress  can  be 
made  toward  therapies  that  can  slow  or  halt  neurofi- 
bromagenesis  and  reduce  the  high  risk  of  malignancy 
associated  with  NF1  disease. 
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Neurofibromatosis  1  and  neurofibromatosis  2  are  two  distinct  genetic  disorders  that  predispose  to  the 
development  of  tumors  primarily  of  the  nervous  system.  A  comparision  of  some  features  of  these 
disorders  is  given  in  Table  20-1. 

NEUROFIBROMATOSIS  (NF1) 

MOLECULAR  BASIS  OF  DISEASE 

NF1  is  an  autosomal  dominant  progressive  disorder  with  high  penetrance  but  extremely  variable 
expressivity  (reviewed  in  Ref.1,2  and  http://www.geneclinics.org/).  The  cardinal  features  are  cafe  au  lait 
macules,  intertrigenous  freckling,  Lisch  nodules,  and  multiple  neurofibromas.  The  National  Institutes  of 
Health  diagnostic  criteria  for  a  diagnosis  of  NF1  established  in  1987  are  still  appropriate  and  are  widely 
used  by  clinicians  and  research  investigators.  Neurofibromas  are  benign  nerve  sheath  tumors  that  arise  on 
peripheral  nerves.  Dermal  neurofibromas  develop  in  virtually  all  cases  ofNFl,  appear  in  late  childhood, 
grow  slowly,  increase  in  number  with  age,  and  are  at  low  risk  for  transformation  to  malignant  peripheral 
nerve  sheath  tumors  (MPNST;  previously  termed  neurofibrosarcoma).  However,  diffuse  plexiform 
neurofibromas  and  deep  nodular  plexiform  neurofibromas  can  give  rise  to  MPNST.  Individuals  affected 
with  NF1  have  a  10%  lifetime  risk  for  MPNST.  Other  neoplasms  epidemiologically-associated  with  NF1 
include  medulloblastoma,  pheochromocytoma,  astrocytoma,  adenocarcinoma  of  the  ampulla  of  Vater. 
Primarily  children  affected  with  NF1  are  at  increased  risk  for  optic  pathway  gliomas  and  brainstem 
gliomas,  rhabdomyosarcomas,  and  malignant  myeloid  leukemias.  NF1  patients  are  also  at  increased  risk 
for  a  second  malignancy,  some  of  which  may  be  treatment-related. 

NF1  is  caused  by  haploinsufficiency  for  the  tumor  suppressor  neurofibromin,  the  protein  product 
of  the  NF1  gene  (Table  20-1).  Most  constitutional  mutations  are  private,  predict  the  truncation  of 
neurofibromin,  and  occur  throughout  the  gene,  although  some  exons  are  more  mutation-rich  than  others 
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(see  below).  Less  frequent  are  missense  mutations  and  large  NF1  deletions  that  involve  contiguous 
genes.  About  one-half  of  cases  are  familial  (inherited  from  an  affected  parent)  and  one-half  are  sporadic 
resulting  from  a  de  novo  NF1  mutation.  Neurofibromin  normally  functions  as  a  negative  regulator  of  the 
ras  oncogene  by  stimulating  the  conversion  of  active  GTP-bound  Ras  to  the  inactive  GDP-bound  form  by 
hydrolyzing  GTP.  Biochemical,  cell  culture,  and  genetic  studies  in  both  NF1  patients  and  mouse  models 
are  consistent  with  a  model  whereby  a  somatic  mutation  inactivates  the  remaining  functional  NF1  gene  in 
a  progenitor  Schwann  cell  as  an  early,  probably  initiating,  event  in  the  development  of  neurofibromas 
(reviewed  in 3).  Biallelic  inactivation  of  NF1  also  occurs  in  progenitor  cells  of  NF1 -associated  tumors 
other  than  those  of  the  nerve  sheath.  Homozygous  inactivation  of  neurofibromin  leads  to  increased 
activated  ras,  resulting  in  aberrant  mitogenic  signaling,  and  consequent  loss  of  growth  control. 


CLINICAL  UTILITY  OF  TESTING 

A  diagnosis  to  NF1  can  nearly  always  be  made  based  on  clinical  findings,  particularly  after  8 
years  of  age.  Clinical  DNA-based  testing  is  available  (http://www.geneclinics.org/).  Testing  is  not 
typically  used  for  diagnostic  purposes,  but  can  be  useful  for  confirming  a  clinical  diagnosis,  reproductive 
counseling,  and  prenatal  diagnosis.  Blanket  recommendations  for  diagnostic  testing  for  NF1  cannot  be 
made  because  the  sensitivity  of  making  a  clinical  diagnosis  is  very  high  and  the  sensitivity  of  molecular 
testing  is  not  100%  (see  below).  Furthermore,  benefits  of  diagnostic  testing  are  subjective  and  may  differ 
from  family  to  family.  Prenatal  diagnosis  of  amniocytes  or  chorionic  villus  tissue  is  available  only  in 
cases  where  the  pathogenic  germline  mutation  has  been  identified  previously  in  an  affected  parent. 

The  primary  genetic  counseling  issue  related  to  molecular  testing  of  NF1  is  the  inability  to  predict 
the  severity  or  course  of  the  disorder  in  a  patient  or  fetus.  Even  among  family  members  who  carry  the 
same  NF1  mutation,  there  can  be  considerable  variation  in  clinical  manifestations  and  complications.  For 
the  majority  of  cases,  there  is  no  correlation  between  genotype  and  phenotype.  For  the  —5-10%  of  cases 
that  are  hemizygous  for  NF1  due  to  a  submicroscopic  deletion  (most  commonly  1.5  Mb)  there  is  a 
predisposition  to  an  early  age  of  onset  and  excessive  numbers  of  dermal  neurofibromas  (Ref.4  and 
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references  therein).  NF1  testing  may  be  useful  to  confirm  a  diagnosis  in  a  patient  with  equivocal 
findings,  such  as  child  who  has  a  few  cafe  au  lait  macules  and  carries  a  presumptive  diagnosis  of  NF1.  In 
this  instance  it  is  important  to  realize  that  the  sensitivity  and  specificity  of  testing  patients  that  do  not 
fulfill  the  NIH  diagnostic  criteria  for  a  diagnosis  of  NF1  is  unknown.  For  unaffected  parents  of  a  child 
with  sporadic  NF1,  recurrence  risk  is  a  concern.  Because  germline  mosaicism  has  been  reported  in  an 
asymptomatic  parent  of  a  child  with  sporadic  NF15,  there  is  a  small  but  unknown  increased  risk  of 
recurrence  even  if  the  child’s  pathogenic  mutation  is  not  detected  in  the  genomic  DNA  of  parental 
lymphoblasts.  Sporadic  NF1  cases  with  post-zygotic  mutations  resulting  in  somatic  mosaicism  have  been 
reported,  but  the  frequency  is  unknown.  Mosaic  individuals,  who  carry  the  NF1  mutation  in  only  a 
fraction  of  their  cells,  may  have  mutation-negative  test  results  due  low  signal  to  noise  ratio,  i.e.,  low  level 
of  mutant  allele  in  a  background  of  two  normal  NF1  alleles.  Genetic  counseling  regarding  the  clinical 
and  reproductive  implications  of  NF1  mosaicism  is  recommended.6 

AVAILABLE  ASSAYS 

NF1  gene  mutation  is  the  only  known  cause  of  the  disorder.  Efficient  detection  of  NF1  gene 
mutations  requires  a  multipronged  approach  due  to  the  large  number  of  exons  and  size  of  the  gene  (Table 
20-1),  variation  in  type  and  distribution  of  mutations,  and  large  fraction  of  private  mutations.  About  70- 
80%  of  mutations  result  in  a  premature  translation  termination  codon  with  nonsense  and  splicing  defects 
being  the  most  common.7  These  can  be  detected  by  the  protein  truncation  test  (PTT)  (see  Chapter  2), 
which  identifies  truncated  neurofibromin  polypeptides  synthesized  by  in  vitro  translation  of  multiple 
overlapping  segments  of  the  NF1  cDNA  isolated  from  primary  lymphoblasts  or  Epstein-Barr  virus 
(EBV)-transformed  lymphoblasts.  A  detection  rate  of  about  80%  can  be  attained  with  an  optimized  PTT 
protocol  (see  below).  The  vast  majority  of  these  mutations  are  private  to  each  individual/family,  although 
there  are  recurrent  mutations  that  may  account  for,  at  most,  a  few  percent  of  cases  (Figure  20-1).  About 
10%  of  NF1  mutations  are  missense  or  in-frame  insertions/deletions  of  a  few  nucleotides,7,8  some  of 
which  show  clustering  (Figure  20-1).  Direct  sequencing  of  cDNA  or  scanning  of  NF1  exons  and  splice 
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junctions  in  genomic  DNA  by  denaturing  high  performance  liquid  chromatography  (DHPLC), 
temperature  gradient  gel  electrophoresis  (TGGE),  single  strand  conformation  polymorphism  (SSCP), 
and/or  heteroduplex  analysis  (HA)  (see  Chapter  2)  have  been  used  to  detect  missense  and  other  subtle 
mutations  in  primary  lymphoblasts.  DHPLC  detected  both  truncating,  missense,  and  small  in-frame 
deletions/insertions  with  a  detection  rate  of  97%  in  a  retrospective  study9  and  of  72%  in  a  prospective 
study.10  For  mutation  scanning  testing  protocols,  the  detection  rate  will  be  laboratory-specific  due  to  the 
degree  of  optimization  of  the  specific  technique;  a  survey  of  testing  laboratories  is  recommended. 

DHLPC  has  the  advantages  of  using  genomic  DNA  and  high  throughput  capability  compared  to  the 
cDNA/gel-based  PTT.  However,  a  recently  reported  high-throughput  PTT  may  be  available  for  clinical 
testing  in  the  future.11  Fluorescent  in  situ  hybridization  (FISH;  see  Chapter  2)  with  NF1  probes  is  used  to 
detect  the  estimated  5-10%  of  cases  due  to  submicroscopic  NF1  microdeletion12  (Figure  20-1),  although  a 
recently  reported  NF1  deletion  junction-specific  PCR  assay  may  be  clinically  applicable  in  the  near 
future.13  Routine  cytogenetic  analysis  is  used  to  detect  rare  cases  due  to  translocation  or  chromosomal 
rearrangement.  Linkage  analysis  is  an  indirect  test  that  tracks  the  inheritance  of  a  disease  allele  in 
members  of  a  family.  Linkage  testing  is  clinically  available  for  at-risk  individuals  with  multiple  family 
members  carrying  a  definitive  diagnosis  ofNFl.  The  availability  of  NF1  intragenic  markers  increases  the 
informativeness  of  this  methodology.  This  may  be  the  quickest,  most  economical  NF1  test  for  individuals 
of  families  that  fulfill  the  testing  criteria. 

INTERPET ATION  OF  TEST  RESULTS 

The  detection  of  a  truncated  neurofibromin  polypeptide  by  PTT  is  the  most  sensitive  method 
(~80%),  but  can  result  in  false  positives.7  High  specificity  requires  identifying  the  underlying  mutation  at 
the  genomic  DNA  and  cDNA  levels  since  false  positives  can  arise  during  handling  of  the  peripheral  blood 
sample  (see  below).  The  interpretation  of  missense  and  subtle  in-frame  alterations  as  pathogenic 
mutations  versus  neutral  polymorphisms  is  complicated  by  the  lack  of  a  functional  assay  for 
neurofibromin.  Apparent  recurrence  of  a  putative  mutation  based  on  prior  literature  reports  must  be 
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interpreted  with  caution,  as  the  majority  of  NF1  mutational  analyses  did  not  sequence  the  entire  gene.  No 
comprehensive  NF1  mutation  database  is  available,  however  some  mutations  have  been  submitted  to  the 
Human  Gene  Mutation  Database  (http://archive.uwcm.ac.uk/uwcm/mg/hgmdO.html)  and  the  National 
Neurofibromatosis  Foundation  International  NF1  Genetic  Mutation  Analysis  Consortium  Web  Site 
(http://www.nf.org/nflgene/nflgene.home.html).  Although  most  likely  rare,  families  with  different 
independent  NF1  inactivating  mutations  in  affected  individuals  have  been  reported,14  presumably  a 
reflection  of  the  high  mutation  rate  of  the  gene  (~  1 05/gamete/generation).  The  interpretation  of  FISH 
with  NF1  probes  can  be  complicated  by  mosaicism  for  an  NF1  microdeletion.15  Furthermore,  until  the 
recent  report  of  an  apparent  tandem  duplication  of  the  NF1  gene16  is  confirmed  or  disproven,  the 
interpretation  of  negative  FISH-based  test  results  should  be  made  with  caution.  The  frequency  of 
mosaicism  for  an  NF1  mutation  is  not  known,  however  this  is  likely  the  underlying  mechanism  for 
patients  with  segmental  or  localized  signs  of  the  disorder.2 

LABORATORY  ISSUES 

Optimal  detection  of  mutations  that  predict  a  truncated  neurofibromin  polypeptide  occurs  when 
the  nonsense-mediated  decay  pathway  is  at  least  partially  inhibited,  thereby  increasing  the  ratio  of  mutant 
transcripts  with  a  premature  termination  codon  to  normal  transcripts.  A  protein  synthesis  inhibitor,  such 
as  puromycin,  in  the  culture  medium  is  effective  for  EBV-transformed  lymphoblasts  or 
phytohemaglutinin-stimulated  primary  lymphocytes.7, 17  Furthermore,  blood  handling  and  shipping 
protocols  must  be  used  to  reduce  false  positives  in  PTT  resulting  from  environmental  effects  such  as  cold 
shock18  or  delay  in  mRNA  isolation.7, 17  There  is  no  approved  program  of  interlaboratory  comparison  for 
NF1  testing;  performance  assessment  must  be  conducted  by  participation  in  ungraded  proficiency  survey 
programs,  split  sample  analysis  with  other  laboratories,  or  other  suitable  and  documented  means.  There 
are  no  commercially  available  NF1  testing  kits,  probes,  or  controls.  Intronic  primers  for  amplification  of 
individual  NF1  exons  that  apparently  do  not  co-amplify  the  NF1  pseudogene  fragments  located  at 
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multiple  chromosomal  sites  have  been  published.19  Some  issues  related  to  NF1  testing  have  been 
reviewed  recently.20 


NEUROFIBROMATOSIS  2  (NF2) 

MOLECULAR  BASIS  OF  DISEASE 

The  development  of  bilateral  vestibular  schwanomas  is  a  hallmark  of  NF2.  Other  commonly- 
associated  tumors  include  schwannomas  of  other  central,  spinal,  and  peripheral  nerves  and  meningiomas 
(reviewed  in  Ref 2,21,22  and  http://www.geneclinics.org/).  Although  these  tumors  are  not  malignant  but 
their  location,  along  with  the  propensity  to  develop  multiple  tumors,  make  this  a  life-threatening  disorder. 
Most  patients  become  completely  deaf  and  can  have  poor  balance,  vision  and  weakness.  The  mean  age  of 
onset  is  18-24  years  and  the  mean  age  of  death  is  36  years.  Ependymomas  and  astrocytomas  occur  less 
frequently  and  are  usually  indolent  CNS  tumors.  Patients  affected  with  NF2  are  at  minimal  increased  risk 
for  malignancy.  Juvenile  posterior  subcapsular  cataract  is  a  common  nontumor  manifestation.  The 
disorder  may  be  under-diagnosed  in  children  who  present  with  ocular  and  skin  manifestations.  Early 
diagnosis  improves  management,  which  is  primarily  surgical  and  radiological.  Modifications  to  the 
criteria  for  a  diagnosis  of  NF2,  initially  established  by  the  1987/1991  National  Institute  of  Health 
Consensus  Conference,  have  been  proposed  to  increase  the  specificity.23 

NF2  is  caused  by  haploinsufficiency  for  the  tumor  suppressor  merlin  (or  schwannomin),  the 
protein  product  of  the  NF2  gene  (Table  20-1).  About  one-half  of  patients  are  the  first  case  of  NF2  in  the 
family.  These  sporadic  cases  result  from  de  novo  mutation  of  the  NF2  gene,  a  significant  fraction  of 
which  are  postzyogotic  mutations  that  result  in  mosaicism  (see  below).  The  majority  of  constitutional 
mutations  are  private,  predict  the  truncation  of  merlin,  and  occur  throughout  the  gene  (see  below). 
Vestibular  schwannomas  develop  from  a  progenitor  Schwann  cell  that  does  not  express  merlin  due  to  a 
somatic  mutation  that  inactivated  the  single  remaining  NF2  gene.  Merlin  is  a  protein  of  the  cytoskeleton 
(reviewed  in  Ref.3,22).  Merline  associates  with  transmembrane  proteins  important  in  adhesion,  proteins 
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involved  in  signaling  pathways,  and  cytoskeletal  proteins.  Merlin  deficiency  alters  cell  adhesion, 
motility,  and  spreading.  The  normal  function  of  merlin  remains  to  be  determined  but  likely  involves 
control  of  adhesion,  proliferation,  and  the  Rho  signaling  pathway. 

CLINICAL  UTILITY  OF  TESTING 

DNA-based  clinical  testing  for  NF2  is  available  (http://www.geneclinics.org/)  and  used  primarily 
for  presymptomatic  testing  of  at-risk  individuals,  typically  young  children  of  an  affected  parent.  An  early 
diagnosis  of  NF2  may  improve  outcome  and  at-risk  children  who  did  not  inherit  the  NF2  mutation  can  be 
spared  costly  brain  imaging  and  audiologic  screening.  Genetic  counseling  is  recommended  prior  to 
testing  presymptomatic  at-risk  children.  Testing  is  also  useful  to  confirm  a  clinical  diagnosis,  which  may 
be  most  helpful  in  sporadic  cases  of  NF2,  particularly  children  who  present  with  ocular  or  skin 
manifestations  or  adults  with  equivocal  findings  or  mild  disease.  Some  of  these  cases  may  be  mosaic  for 
an  NF2  mutation,  as  the  frequency  of  mosaicism  is  estimated  at  16.7  -  24.8%  of  sporadic  cases.24 
Genetic  counseling  regarding  the  clinical  and  reproductive  implications  of  NF2  mosaicism  is 
recommended.6  Testing  is  also  useful  for  reproductive  counseling  and  prenatal  diagnosis.  Prenatal 
diagnosis  of  NF2  using  amniocytes  or  chorionic  villus  tissue  is  available  only  in  cases  where  the 
pathogenic  germline  mutation  has  been  identified  previously  in  an  affected  parent.  Preimplantation 
genetic  diagnosis  of  NF2  has  been  reported.25 

The  primary  genetic  counseling  issues  regard  predicting  the  course  of  the  disorder  and  recurrence 
risks.  There  are  genotype/phenotype  correlations,  but  they  cannot  predict  the  age  of  onset  or  the  course  of 
disease  for  an  individual  patient.  Typically,  constitutional  frameshift  and  nonsense  mutations  are 
associated  with  severe  NF2,  defined  by  earlier  age  at  onset  and  higher  frequency  and  mean  number  of 
tumors.26, 27  Constitutional  missense  and  small  in-frame  mutations  are  thought  to  be  associated  with  mild 
disease27  and  mutations  in  splice  donor  and  acceptor  sites  result  in  variable  clinical  outcomes.28 
Recurrence  risks  for  asymptomatic  parents  of  an  affected  child  are  unknown,  but  are  somewhat  greater 
than  the  population  risk  due  to  the  possibility  of  germline  mosaicism  in  a  parent.29  For  mosaic  patients, 
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the  risk  of  transmitting  NF2  to  offspring  is  <50%,  depending  upon  the  proportion  of  gametes  that  carry 
the  NF2  mutation.6  Offspring  that  do  inherit  the  mutation  however,  will  have  a  constitutional  NF2 
mutation  and  may  have  more  severe  disease  than  their  mosaic  parent. 


AVAILABLE  ASSAYS 

NF2  gene  mutation  is  the  only  known  cause  of  the  disorder.  About  66%  of  mutations  are 
nonsense  or  ffameshifts  that  predict  the  premature  truncation  of  merlin.30,31  About  10%  of  mutations  are 
missense  (although  most  have  yet  to  be  tested  in  functional  assays)  and  the  remaining  are  at  splice 
donor/acceptor  sites  or  due  to  submicroscopic  rearrangements.  A  multipronged  approach  to  testing  is 
optimal  due  to  the  high  frequency  of  private  constitutional  mutations,  the  high  frequency  of  post-zygotic 
mutations,  and  the  distribution  of  mutations  throughout  the  gene.  Direct  sequencing  of  NF2  in  genomic 
DNA  is  clinically  available  and  has  a  detection  rate  of  about  60%  in  familial  NF2  cases.  The  exon 
scanning  techniques  of  SSCP,  TGGE,  and  HA  (see  Chapter  2),  followed  by  direct  sequencing  to  identify 
the  underlying  NF2  mutation,  detected  54-81%  of  mutations  in  familial  NF2  cases.26,27,32,33  The 
detection  rate  of  either  sequencing  or  exon  scanning  methods  is  significantly  lower  (34-51%)  in  sporadic 
cases  due  in  part  to  the  high  frequency  of  post-zygotic  NF2  mutations,  which  can  be  masked  by  the 
presence  of  normal  alleles.27'29,32  For  mutation  scanning  tests,  the  detection  rate  will  be  laboratory- 
specific  due  to  the  varying  degrees  of  optimization  of  the  technique;  a  survey  of  testing  laboratories  is 
recommended. 

Because  schwannomas  are  clonal  tumors  with  minimal  cellular  admixture,  NF2  mutations  can  be 

1 

detected  at  high  frequency  in  tumor  tissue.  Testing  of  tumor  tissue  is  available  clinically 
(http://www.geneclinics.org/)  and  is  most  useful  in  cases  where  a  mutation  is  not  detected  in  primary 
lymphoblasts,  clinical  manifestations  are  suggestive  of  somatic  mosaicism,  or  constitutional  tissue  is 
unavailable.  Different  exon  scanning  methods  have  detected  from  60-75%  of  mutations,  primarily  in 
vestibular  schwannomas.24, 34  Mutations  are  likely  to  be  germline  (rather  than  somatic)  if  the  identical 
mutation  is  detected  in  two  or  more  pathologically  or  anatomically  distinct  tumors  or  if  a  tumor  shows 
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loss  of  heterozygosity  for  NF2  intragenic/flanking  loci,  while  constitutional  tissue  is  heterozygous  at 
these  loci.  Mutational  analysis  of  tumors  is  expected  to  have  the  greatest  sensitivity  for  NF2  somatic 
mosaic  mutations24  and  sporadic  cases  with  negative  results  from  mutation  scanning  tests.35  Routine 
cytogenetic  analysis  and/or  FISH  with  NF2  probes  will  identify  chromosomal  rearrangements.  One  study 
detected  submicroscopic  rearrangements  in  70%  of  patients  in  whom  mutation  scanning  test  results  were 
negative.36  Rearrangements  (primarily  deletions)  were  intragenic  or  involved  all,  or  nearly  all,  of  the  NF2 
gene.  It  is  not  known  if  submicroscopic  whole  gene  deletions  extend  into  flanking  contiguous  genes  that 
may  contribute  to  the  phenotype  as  in  NF1  microdeletions  (see  above).  Linkage  analysis  is  clinically 
available  for  at-risk  individuals  and  fetuses  with  multiple  family  members  carrying  a  definitive  diagnosis 
of  NF2  who  are  willing  to  participate  in  the  testing  process.  The  availability  of  highly  informative  NF2 
intragenic  markers  increases  the  specificity  of  this  methodology.  For  certain  families,  linkage  analysis 
will  be  the  most  cost  effective,  fastest,  and  definitive  test.  It  can  sometimes  be  an  option  when  mutation 
scanning  test  results  are  negative. 

INTERPRETATION  OF  TEST  RESULTS 

Interpretation  of  the  results  of  exon  scanning  tests  requires  identifying  the  underlying  NF2 
mutation  at  the  genomic  DNA  and/or  cDNA  levels  to  avoid  false  positives.  Functional  assays  for  merlin 
have  been  developed  that  can  provide  insight  into  the  interpretation  of  missense  and  subtle  in-ffame 
alterations  as  pathogenic  mutations  versus  neutral  polymorphisms.37,38  No  comprehensive  NF2 mutation 
database  is  available,  however  some  mutations  have  been  submitted  to  the  Human  Gene  Mutation 
Database  (http://archive.uwcm.ac.uk/uwcm/mg/hgmdO.html)  and  to  an  NF2  Mutation  Information  site 
(http://neuro-trialsl .mgh.harvard.edu/nf2/).  Somatic  mosaicism  or  NF2  gene  deletions  must  be 
considered  in  patients  that  have  a  negative  mutation  scanning  test  result,  regardless  of  the  severity  of  their  - 
manifestations.  It  is  optimal  for  NF2  linkage  testing  not  to  include  the  first  affected  member  in  a  family, 
since  this  individual  may  be  mosaic  for  an  NF2  mutation,  which  can  lead  to  misinterpretation  of  test 
results.39 
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LABORATORY  ISSUES 

There  is  no  approved  program  of  interlaboratory  comparison  for  NF2  testing;  performance 
assessment  must  be  conducted  by  participation  in  ungraded  proficiency  survey  programs,  split  sample 
analysis  with  other  laboratories,  or  other  suitable  and  documented  means.  There  are  no  commercially 
available  NF2  testing  kits,  probes,  or  controls.  Direct  gene  sequencing  is  not  the  optimal  test  to  detect 
mosaicism  for  an  NF2  mutation  in  lymphoblasts,  as  reliable  detection  of  a  low  level  point  mutation  will 
be  difficult.  Exon  scanning  techniques  that  are  semi-quantitative,  such  as  TGGE,  will  detect  relative 
intensity  differences  between  heteroduplexes  and  homoduplexes  that  suggest  possible  mosaicism.32 
Depending  upon  age,  fixation,  and  storage  conditions,  some  tumors  may  not  yield  nucleic  acid  of 
sufficient  quality  for  mutational  analysis. 
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Table  20-1 .  Comparison  of  NF1  and  NF2  disorders. 


Feature 

Neurofibromatosis  1  (NF1) 

Neurofibromatosis  2  (NF2) 

Alternate  name 

peripheral  neurofibromatosis;  von 

central  neurofibromatosis;  bilateral 

Recklinghausen  neurofibromatosis 

acoustic  neuroma 

OMIM  accession  number1 

162200 

101000 

Mode  of  inheritance 

autosomal  dominant 

autosomal  dominant 

Frequency  of  disorder 

1/3000-1/4000  worldwide 

1/33,000-1/40,000  worldwide 

Gene  symbol 

NF1 

NF2 

Chromosomal  location 

1 7ql  1 .2 

22ql2.2 

Gene  size;  transcript  size 

-350  kb;  -11-13  kb2 

-110  kb;  2  kb2 

Genbank  accession  no. 

NTO 10799;  NM_000267 

Y 18000;  NM_000268 

(gene;cDNA)3 

Number  of  exons 

60 

17 

Tissue  expression  pattern 

Widely  expressed 

Widely  expressed 

Protein  product 

Neurofibromin  (>220  kD;  2818) 

merlin,  also  known  as  schwannomin 

(size  (kD);  #  residues) 

(65  kD;  595) 

Normal  functions  of  protein 

tumor  suppressor;  negative 

tumor  suppressor;  associates  with 

regulator  of  ras  oncogene 

proteins  of  the  cytoskeleton 

Common  associated  tumors 

Neurofibroma,  MPNST,  optic 

Bilateral  vestibular  schwannomas. 

pathway  and  brainstem  gliomas 

schwannomas  of  other  central  and 

peripheral  nerves,  meningiomas 

Animal  models 

— i - - 

mouse,  Drosophila 

mouse,  Drosophila 

- - 1 - - 1 - 

Online  Mendelian  Inheritance  in  Man  (http://www.ncbi  .nlm.nih.gov:80/entrez/query.fcgi?db=OMIM). 

Alternative  splicing  produces  transcripts  of  varying  lengths. 
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3See  Gene  Lynx  Human  (http://www.genelynx.org/)  for  a  compilation  of,  and  hyperlinks  to,  gene,  protein 
structure,  and  genomic  resources. 
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FIGURE  LEGENDS 

Figure  20-1.  Genomic  structure  and  mutations  in  NF1  and  NF2  genes.  A).  At  the  top  is  a  schematic  of 
the  NF1  gene  region  at  chromosome  segment  17qll.2.  The  350  kb  NF1  gene  is  flanked  by  51  kb 
paralogs  NF1REP-P1  (previously  termed  NF1REP-P)  and  NF1REP-M,  which  are  in  direct  orientation 
(orange  boxes).  Homologous  recombination  between  these  NF1REP  elements  results  in  the  recurrent  1.5 
Mb  NF1  microdeletion.4  NF1REP-P2  is  a  partial  element  with  a  limited  role  in  mediating  NF1 
microdeletions.  The  60  exons  of  the  NF1  are  represented  by  boxes  (not  to  scale)  and  exon  numbering  is 
sequential  except  as  indicated.  The  GRD  (exons  21 -27a)  and  a  cy stein/serine-rich  domain  with  3  cysteine 
pairs  suggestive  of  ATP  binding  (exons  11-17)  are  indicated.  Grey  boxes  indicated  alternatively  spliced 
exons  that  vary  in  abundance  in  different  tissues.  Mutations  have  been  identified  in  virtually  every  exon. 
Exons  where  mutations  are  apparently  in  greater  abundance  than  expected  are  indicated  (green  boxes).7, 8’ 
10  In  one  study,  exons  7,  10a, b,c,  23-2,  27a,  29,  37  and  accounted  for  30%  of  mutations,  15  of  which  were 
in  exons  10a,b,c,  which  harbor  3  recurrent  mutations,  including  Y489C,  which  alone  may  account  for 
~2%  of  mutations.  Blue  boxes  indicate  exons  that  had  clusters  of  missense  and/or  single  codon  deletion 
mutations.8  Some  of  the  recurrent,  although  still  infrequent,  mutations  are  given  below  the  exons.  B)  The 
17  exons  of  the  NF2  gene  are  represented  by  boxes  (not  to  scale)  and  exon  numbering  is  sequential.  The 
Protein  4.1-homology  domain  thought  to  mediate  binding  to  cell  surface  glycoproteins  (exons  2-8),  the  a- 
helical  domain  (exons  10-15),  and  the  unique  C-terminus  (exons  16-17)  are  indicated.  Grey  boxes 
indicated  alternatively  spliced  exons;  the  inclusion  of  exon  16  creates  a  alternate  termination  codon 
resulting  in  a  slightly  truncated  protein.  Mutations  have  been  identified  involving  each  exon  except  for 
16.  Exons  containing  recurrent  mutations  indicated  (green  boxes).  In  several  studies,  R57X  occurred  in 
8%  of  familial  constitutional  mutations.  Other  recurrent  nonsense  codons  are  shown. 


Recurrent  1 .5  Mb  NF1  microdeletion  (~  5-1 0%  of  mutations) 
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ABSTRACT 
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Neurofibromatosis  1  (NF1)  is  an  autosomal  dominant  disorder  characterized  by  cellular 
overgrowth  of  neural  crest  derived  tissues,  a  key  feature  being  benign  Schwann  cell  tumors 
called  neurofibromas.  NF1  is  caused  by  mutations  in  the  NF1  gene.  There  is  variable 
expressivity,  and  it  is  thought  that  other  genes  likely  modify  the  NF1  phenotype.  The  proto¬ 
oncogene  ERBB2  (a.k.a.  HER2/neu )  encodes  a  tyrosine  kinase  receptor,  which  is  a  member  of 
the  epidermal  growth  factor  receptor  family.  ErbB  proteins  have  been  implicated  in  animal 
neurofibroma  development,  via  a  specific  activating  ERBB2  mutation.  However,  no  human 
tumors  have  been  reported  with  this  mutation,  including  29  NF1  tumors  and  19  non-NFl 
Schwann  cell  tumors  examined  in  this  study.  An  Ile/Val  polymorphism  exists  at  codon  655  in 
the  transmembrane  domain  near  the  mutation  and  is  associated  with  an  increased  risk  for  breast 
cancer.  The  role  that  the  ERBB2  gene  product  plays  in  cell  proliferation,  as  well  as  the  critical 
function  of  the  transmembrane  domain  in  receptor  activation,  make  this  polymorphism  a 
candidate  for  a  modifier  gene  for  NF1.  We  examined  the  genetic  association  between  the 
ERBB2  codon  655  alleles  and  NF1.  Among  277  NF1  patients  and  290  controls,  we  found  that 
the  risk  of  having  NF1  with  the  Val/Val  genotype  was  lower  than  with  the  Ile/Ile  genotype. 
Furthermore,  the  number  of  NF1  patients  with  a  homozygous  Val/Val  genotype  is  significantly 
lower  than  that  found  in  our  control  populations.  These  data,  the  first  report  of  a  genetic 
association  analysis  of  NF1,  suggest  that  there  may  be  a  biological  relationship  between  ERBB2 
genotype  and  NF1  phenotype. 

Key  words:  Neurofibromatosis  1  (NF1),  ERBB2,  polymorphism/association,  modifier  gene 
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INTRODUCTION 


Neurofibromatosis  1  (NF1)  affects  approximately  1  in  3500  individuals,  and  is 
characterized  by  benign  overgrowth  of  neural  crest  derived  tissues,  including  the  formation  of 
cafe-au-lait  spots,  Lisch  nodules  and  neurofibromas.  Neurofibromas  are  slow-growing  tumors  of 
the  peripheral  nerve  sheath,  composed  mainly  of  Schwann  cells.  Some  patients  may  have  only  a 
few  small  dermal  neurofibromas,  while  other  can  develop  thousands  and/or  have  large 
neurofibromas  from  on  major  nerve  branches.  Clinical  symptoms  are  highly  variable  between 
patients,  even  within  families,  but  this  is  not  obviously  correlated  to  NF1  mutation  type. 
Consequently,  it  is  thought  that  along  with  stochastic  factors,  other  genes  may  play  a  role  as 
modifiers  in  NF1  phenotype  (Easton  et  al.,  1993). 

The  proto-oncogene  ERBB2  (or  HER2/neu )  encodes  a  transmembrane  tyrosine  kinase 
receptor  and  is  a  member  of  the  epidermal  growth  factor  receptor  family  thought  to  control  a 
number  of  cellular  functions,  such  as  cell  proliferation  (reviewed  in  Dougall  et  al.,  1994).  The 
receptor  heterodimerizes  with  other  members  of  the  ErbB  family,  activating  the  tyrosine  kinase 
domain  and  transmitting  growth  factor  signals  downstream  to  cytoplasmic  targets,  including  the 
MAPK  pathway  (which  is  also  downstream  in  the  NF1  pathway).  These  growth  factors  include 
ligands  known  to  stimulate  Schwann  cell  proliferation.  The  ERBB2  gene  is  frequently  amplified 
or  over-expressed  in  breast  cancer  and  a  number  of  adenocarcinomas,  often  signifying  a  poor 
prognosis  (reviewed  in  Dougall  et  al.,  1994).  Transplacental  administration  of  iV-nitroso-iV- 
ethylurea  (ENU)  to  rats  and  hamsters  induces  multiple  tumors  in  offspring  at  a  high  incidence 
(Bargmann  and  Weinberg,  1988;  Nakamura  et  al.,  1994).  The  rat  Schwann  cell  tumors  and 
hamster  neurofibromas  contain  an  activating  point  mutation  in  the  ERBB2  transmembrane 
domain  (e.g.  rat  codon  664;  analogous  to  codon  659  in  humans  and  hamsters),  leading  to  a  valine 
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to  glutamic  acid  amino  acid  substitution.  This  mutation  is  believed  to  cause  an  increase  in  the 
tyrosine  kinase  activity,  and  the  mutated  gene  has  been  shown  to  transform  NIH  3T3  cells 
(Bargmann  et  al.,  1986). 

To  our  knowledge,  no  human  tumors  have  been  found  with  this  mutational  change, 
although  a  coding  region  polymorphism  has  been  identified  in  the  transmembrane  region 
(Papewalis  et  al.,  1991).  The  polymorphism  is  located  at  codon  655  in  humans  and  encodes 
either  an  isoleucine  (ATC)  or  valine  (GTC).  It  is  unknown  whether  the  polymorphic  proteins 
differ  in  their  function  or  tyrosine  kinase  activity.  Interestingly,  results  from  a  population-based, 
case-control  study  examining  the  association  of  the  ERBB2  polymorphism  with  the  risk  of  breast 
cancer  showed  a  statistically  significant  increased  risk  for  early  onset  breast  cancer  in  women 
homozygous  for  the  valine  allele  (Xie  et  al.,  2000).  To  search  for  a  connection  between  ERBB2 
and  NFl/Schwann  cell  tumors,  we  first  screened  for  the  codon  659  mutation  in  a  set  of  tumors, 
and  then  searched  for  a  genetic  association  between  the  ERBB2  codon  655  alleles  and  NF1. 

y 

MATERIALS  AND  METHODS 
Patient  Populations. 

Case  samples  consisted  of  unrelated  NF1  patient  leukocyte  DNA  previously  collected 
under  IRB  approval  in  our  laboratory  (n=  180)  and  an  anonymous  set  generously  provided  by  Dr. 
Karen  Stephens  (n=97)  (University  of  Washington,  Dept,  of  Pathology).  The  NF1  diagnosis  was 
made  via  clinical  observations  satisfying  the  NIH  diagnostic  criteria.  These  patients  were 
ascertained  through  genetics  or  other  clinics,  or  via  advertising  through  the  National 
Neurofibromatosis  Foundation.  Two  control  groups  were  obtained,  with  the  first  group  (Cl) 
consisting  of  unrelated  non-NFl  individuals  whose  leukocyte  DNA  was  collected  previously  in 
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our  laboratory  (n=152).  The  second  group  (C2)  consisted  of  a  panel  of  anonymous  control  (non- 
NF1)  DNAs  generously  given  to  us  by  Dr.  Lawrence  Brody  (n=138)  (National  Human  Genome 
Research  Institute,  NIH)  (Struewing  et  al.,  1995).  DNA  extractions  were  performed  using  the 
PUREGENE  DNA  Isolation  Kit  (Gentra  Systems).  The  NF1  and  control  samples  in  our 
laboratory  consist  mostly  of  persons  of  a  Caucasian  background  from  the  state  of  Florida. 

The  ERBB2  codon  659  mutation  analysis  studied  DNA  extracted  from  primary  tumor 
samples  previously  collected  in  our  laboratory.  The  sporadic  neurofibromas  are  from  patients 
who  do  not  meet  the  diagnostic  criteria  for  NF1,  while  the  NF1  neurofibromas  and  MPNSTs 
(malignant  peripheral  nerve  sheath  tumors)  are  from  patients  who  do  meet  the  diagnostic  criteria 
for  NF1.  The  peripheral,  cutaneous  schwannomas  are  from  both  NF2  and  non-NF2  patients. 

Mutation  Analysis. 

A  144bp  fragment  of  the  ERBB2  gene  was  PCR  amplified  from  tumor  DNA  using 
primers  from  Nakamura  et  al  (1994)  (NeuA  5 AG AGCC AGCCCTCTGACGTC ,  NeuB  5’- 
CGTTTCCTGCAGCAGTCTCC)  and  standard  conditions  with  a  58°C  annealing  temperature. 
These  products  were  sequenced  using  standard  cycle-sequencing  protocols  in  our  lab  (ABI 
BigDye  Terminator  Kit)  and  results  analyzed  with  the  SeqEd  vl.0.3  program  (ABI). 

Genetic  Polymorphism  Analysis. 

Genotypes  for  ERBB2  were  determined  with  a  PCR-RFLP  based  assay.  The  primers 
were  designed  to  amplify  the  transmembrane  domain  and  were  based  on  the  published  sequence 
for  the  ERBB2  gene  (ErbB2-lF  5  ’  - AGAGT AGGAGAGGGTCC A AGCC-3 ')  or  taken  from 
Nakamura  et  al  (1994)  as  for  the  mutation  analysis  (NeuA,  NeuB).  The  PCR  amplification  was 
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performed  under  standard  conditions  with  an  annealing  temperature  of  69°C  for  the  ErbB2- 
IF/NeuB  primer  pair  and  58°C  for  the  NeuA/NeuB  primer  pair.  The  PCR  products  were  then 
digested  with  the  restriction  enzyme  BsmBI  (NEB).  The  completely  uncut  product  represents  a 
homozygous  isoleucine  genotype,  while  the  completely  cut  product  represents  the  homozygous 
valine  genotype.  The  digest  products  were  separated  with  8%  native  PAGE  and  stained  with 
ethidium  bromide  for  visualization.  Direct  DNA  sequencing  was  performed  on  some  samples  to 
confirm  the  results  for  the  RFLP  analysis. 

Due  to  the  lack  of  detailed  information  on  patient  ethnicity,  a  well-characterized 
insertion/deletion  polymorphism  in  intron  16  of  the  human  angiotensin  converting  enzyme 
(hACE)  gene  was  used  to  provide  evidence  that  ethnic  background  differences  of  the  NF1  and 
control  populations  were  not  contributing  to  ERBB2  differences.  Genotypes  were  determined 
with  a  PCR  assay  using  primers  from  Lindpaintner  et  al  (1995)  (hACE3-F  5’- 
GCCCTGCAGGTGTCTGCAGCATGT-3’  and  hACE5-R  5’- 

GGATGGCTCTCCCCGCCTTGTCTC-3’).  This  product  was  amplified  under  standard  PCR 
conditions  with  an  annealing  temperature  of  58°C.  This  primer  set  amplifies  both  the  insertion 
and  deletion  alleles,  597  bp  and  319  bp  respectively.  Because  the  deletion  allele  can  be 
preferentially  amplified  in  a  heterozygous  sample,  an  insertion-specific  primer  set  was  used  to 
confirm  the  absence  or  presence  of  this  larger  allele  in  samples  which  were  found  to  be 
homozygous  for  the  deletion  allele  (hACE5-F  5’-TGGGACCACAGCGCCCGCCACTAC-3’ 
and  hACE5-R  5’-TCGCCAGCCCTCCCATGCCCATAA-3’)  (Lindpaintner  et  al.,  1995).  This 
product  was  amplified  with  a  1  min  denaturing  step  at  94°C  and  a  combined  annealing  and 
extension  step  for  1  min  at  72°C,  for  35  cycles.  PCR  products  were  electrophoretically  separated 
on  1%  agarose  gels  and  visualized  with  ethidium  bromide  staining. 
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Statistical  Analysis. 

Statistical  analysis  (odds  ratios  and  95%  confidence  intervals)  was  used  to  determine  a 
possible  correlation  between  ERBB2  genotype  and  NF1.  Odds  ratios  (reported  as  cases/controls) 
are  written  as  the  ratio(95%  confidence  interval).  Hardy- Weinberg  analysis  and  chi-squared 
analysis  for  differences  in  genotype  frequencies  were  completed  as  well.  All  statistics  are 
reported  based  on  the  comparison  of  the  NF1  group  to  each  control  group  separately,  as  well  as 
combined. 

RESULTS 

Ten  sporadic  (non-NFl)  neurofibromas,  9  Schwannomas,  20  NF1  neurofibromas  and  9 
NF1  MPNSTs  were  analyzed  for  the  presence  of  the  ERBB2  codon  659  missense  mutation  in  the 
transmembrane  domain,  and  no  mutations  were  found. 

Figure  1  shows  genotyping  from  the  ErbB2-lF/Neu  B  PCR  product.  All  three  genotypes 
are  represented  and  the  results  matched  those  from  the  direct  sequencing  data.  A  homozygous 
He  genotype  shows  an  uncut  band  at  280  bp  (patients  1  and  5).  The  heterozygous  Ile/Val 
genotype  shows  one  280  bp  band  and  two  lower  bands  of  165  bp  and  1 15  bp  (patients  2,  3,  4). 
The  homozygous  Val  genotype  shows  the  two  lower  bands  only  (patient  6). 

Genetic  analysis  was  completed  on  277  unrelated  NF1  patient  samples  and  152  Cl,  138 
C2,  and  290  Cl  and  C2  combined  samples  (Table  1).  The  Val  allele  was  less  prevalent  among 
NF1  patients  (18%)  than  among  either  Cl  (26%),  C2  (26%)  or  the  combined  controls  (26%). 

The  odds  ratios  for  the  allele  count  are  given  in  Table  1.  The  risk  of  having  NF1  with  the 
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Val/Val  genotype  is  lower  [OR=0.27(0. 12-0.65)]  than  that  with  the  Ile/Ile  genotype 
[OR=0.63(0.44-0.88)]. 

In  addition,  the  number  of  samples  with  a  Val/Val  genotype  is  significantly  lower  in  the 
NF1  population  than  in  the  control  populations  (Cl,  p=0.001;  C2,  p=0.002;  C1+C2,  p=0.002). 

In  contrast,  the  frequencies  for  the  remaining  genotypes  are  not  significantly  different  among  the 
two  populations  (Ile/Ile:  Cl,  p=0.120;  C2,  p=0.086;  C1+C2,  p=0. Ill  andlle/Val:  Cl,  p=0.480; 
C2,  p=0.390;  C1+C2,  p=0.451).  Hardy-Weinberg  equilibrium  analysis  showed  that  there  were 
small  differences  between  the  observed  and  expected  numbers  of  people  in  each  genotype,  but 
the  differences  were  not  statistically  significant  using  chi-squared  tests  for  independence  (NF1, 
p=0.983;  Cl,  p=0.587;  C2,  p=0.822;  C1+C2,  p=0.590). 

The  human  angiotensin  converting  enzyme  ( hACE)  gene  has  a  well-characterized 
insertion/deletion  polymorphism  in  intron  16  for  which  the  genotype  frequencies  vary  based  on 
ethnic  background  (Mathew  et  al.,  2001).  Genotyping  of  the  hACE  gene  in  both  NF1  and  control 
populations  showed  no  statistically  significant  difference  in  genotype  frequencies  between  the 
populations  (Ins/Ins:  Cl,  p=0.870;  C2,  p=0.729;  Cl+2,  p=0.869  and  Ins/Del:  Cl,  p=0.751;  C2, 
p=0.411;  Cl+2,  p=0.671  and  Del/Del:  Cl,  p=0.620;  C2,  p=0.232;  Cl+2,  p=0.533).  Allele 
frequencies  also  showed  no  statistically  significant  difference  (data  not  shown).  This  implies 
that  the  statistically  significant  differences  in  genotype  frequency  seen  for  the  ERBB2  gene  are 
not  a  result  of  ethnic  differences  between  the  groups. 

DISCUSSION 

The  ERBB2  gene  product,  pl85,  is  a  1255  amino  acid  glycoprotein  that  is  closely  related 
to  the  other  members  of  the  EGFR  family  involved  in  signal  transduction.  These  receptors  share 
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a  similar  structure  consisting  of  an  extracellular  ligand-binding  domain,  a  single  hydrophobic 
transmembrane  domain,  and  a  cytoplasmic  tyrosine  kinase  domain.  Alterations  in  the 
transmembrane  segment  may  have  profound  effects  on  the  receptor  activity.  The  ENU-induced 
valine  to  a  glutamic  acid  mutation  at  rat  codon  664  (analogous  to  human  and  hamster  codon  659) 
in  the  transmembrane  domain  is  believed  to  enhance  receptor  dimerization,  which  results  in  the 
activation  of  the  receptor  tyrosine  kinase  without  ligand  binding.  NMR  studies  have  shown  that 
this  activating  mutation  can  cause  significant  intramolecular  rearrangements  which  could 
influence  its  lateral  associations  (Sharpe  et  al.,  2000).  In  addition,  in  vitro  studies  have  shown 
that  activation  of  ERBB2  is  sufficient  to  initiate  the  immortalization  and  transformation  of 
immature  Schwann  cells,  as  well  as  NIH  3T3  cells  (Sherman  et  ah,  1999;  Bargmann  et  ah,  1986). 

A  recent  report  analyzed  a  series  of  spontaneous  peripheral  nerve  sheath  tumors  from 
domesticated  animals  and  found  the  same  ERBB2  mutation  in  75%  of  the  malignant  tumors 
(Stoica  et  al.,  2001).  Although  a  homologous  mutation  has  not  been  found  in  any  human  tumors, 
including  our  sample  set  of  48  peripheral  nerve  sheath  tumors,  there  exists  an  Ile/Val 
polymorphism  nearby  at  codon  655  in  the  transmembrane  domain.  It  is  not  known  if  this 
substitution  results  in  any  change  in  the  tyrosine  kinase  activity.  Nevertheless,  our  study  has 
found  that  the  number  of  NF1  patients  with  a  homozygous  Val  genotype  is  significantly  lower 
than  that  found  in  our  control  populations.  This  finding  suggests  that  the  polymorphism  may  be 
an  NF1  modifier  gene  (or  linked  to  a  modifier).  Interestingly,  the  heterozygous  Ile/Val  genotype 
is  not  significantly  lower  in  the  NF1  population.  One  possible  explanation  is  a  selective 
embryonic  mortality  associated  with  the  combination  of  an  NF1  mutation  and  an  ERBB2 
homozygous  Val  genotype.  Heterozygous  NF1  cells  have  decreased  NF1-GAP  activity  and  thus 
have  increased  signaling  through  the  ras  pathway  (Kim  et  al.,  1995).  This  signaling  cascade 


8 


connects  to  the  MAPK  pathway  through  which  ERBB2  signals.  Perhaps  downstream  alterations 
in  MAPK  signaling  by  a  combination  of  NF1  mutations  and  homozygosity  for  ERBB2  codon  655 
valine  allele  may  predispose  for  a  critical  effect  in  development.  Alternatively,  the  Val 
homozygous  genotype  at  ERBB2  can  attenuate  NF1  phenotype,  such  that  persons  with  an  NF1 
mutation  and  Val/Val  ERBB2  genotype  might  have  a  greater  chance  of  failing  to  display 
sufficient  features  to  be  diagnosed  and/or  brought  to  medical  attention.  However,  of  the  4  (out  of 
7  total)  Val/Val  NF1  patients  for  which  phenotype  information  is  known,  two  each  have  a 
serious  medical  complication;  and  the  other  two  are  only  mildly  affected,  although  the  serious 
complications  might  be  due  to  stochastic  factors.  Thus,  these  data  support  the  notion  that  ERBB2 
genotype  may  have  a  biologically-based  relationship  to  NF1  phenotype  through  mechanisms 
which  bear  further  investigation. 
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Table  1.  ERBB2  allele  frequencies  and  odds  ratios  [OR= 
ratio(95%  Confidence  Interval)]  for  the  NF1  and  control 
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FIGURE  LEGENDS 


Fig  1.  Ethidium-bromide  stained  polyacrylamide  gel  showing  genotyping  of  the  ERBB2 
polymorphism.  The  first  lane  shows  the  lkb  ladder  marker  (Invitrogen).  For  each  sample,  the 
undigested  product  is  shown  first  (U).  The  BsmBI  digested  product  is  shown  second  (C),  with 
the  top  band  (280bp)  representing  the  lie  allele  and  the  bottom  bands  (165bp,  115bp) 
representing  the  cut  Val  allele.  Positive  control  and  patient  6  show  the  Val  homozygous 
genotype. 
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ABSTRACT 


Microdeletions  involving  the  entire  NF1  gene  cause  ~10%  of  neurofibromatosis  1  cases  and  are 
associated  with  an  early  age  of  onset  of  dermal  neurofibromas.  The  majority  are  recurrent  1.5  Mb 
deletions  mediated  by  homologous  recombination  between  ~50  kb  paralogs,  termed  NFIREPs,  that  flank 
the  NF1  gene.  Although  over  25  disorders  result  from  rearrangements  mediated  by  paralogous 
recombination,  little  is  known  about  the  location  and  nature  of  the  breakpoints  at  the  sequence  level. 
Towards  elucidating  the  mechanism(s)  of  microdeletion,  we  sequenced  NFIREP-mediated  NF1 
microdeletions  and  identified  the  breakpoint  intervals  by  use  of  paralogous  specific  variants  (PSV).  We 
identified  2  paralogous  recombination  sites,  termed  PRS1  and  PRS2,  which  are  4,154  and  6,315  bp  in 
length  and  ~15  kb  apart.  Within  each  region,  NF1  microdeletion  breakpoints  were  clustered  with  64% 
(9/14)  of  sequenced  PRS1  breakpoints  in  a  551  bp  hotspot  interval,  while  86%  (25/29)  of  PRS2 
breakpoints  occurred  in  a  2,292  bp  hotspot.  The  breakpoint  frequency  decreased  in  a  generally 
symmetrical  fashion  on  both  sides  of  each  hotspot,  consistent  with  an  initiation  site  for  crossing  over  in 
this  region.  The  pattern  of  PS  Vs  at  one  breakpoint  interval  showed  evidence  of  gene  conversion,  which  is 
consistent  with  the  double-strand  break  repair  model  of  yeast.  We  developed  PRS1  and  PRS2  deletion 
junction-specific  PCR  assays,  screened  a  cohort  of  NF1  deletion  patients,  and  found  that  69%  (N=78)  of 
breakpoints  occurred  at  these  sites;  PRS2-mediated  deletions  were  the  most  prevalent.  Our  results  show 
that  despite  ~50  kb  of  98%  sequence  identity  between  NF1REP  paralogs,  meiotic  recombination 
preferentially  occurs  at  discrete  hotspots. 


INTRODUCTION 


Neurofibromatosis  type  1  (NF1;  OMIM  16220)  is  a  common  autosomal  disorder  that  predisposes 
to  benign  and  malignant  tumor  development  (1-3).  Virtually  all  affected  individuals  develop  multiple 
dermal  neurofibromas,  which  are  benign  tumors  of  the  peripheral  nerve  sheath.  Dermal  neurofibromas 
typically  become  apparent  around  the  time  of  puberty  and  they  increase  in  number  with  age  (4).  Other 
tumors  include  non-dermal  neurofibromas,  optic  nerve  gliomas,  and  malignant  peripheral  nerve  sheath 
tumors.  A  host  of  other  manifestations  can  be  associated  with  NF1  including  cafe  au  lait  spots,  axillary 
and  inguinal  freckling,  Lisch  nodules,  learning  disabilities,  and  distinctive  bony  abnormalities  (1, 5,  6). 
NF1  patients  are  constitutionally  heterozygous  for  an  inactivating  mutation,  which  results  in  functional 
haploinsufficiency  of  the  NF1  gene  product,  neurofibromin,  as  demonstrated  in  humans  by  NF1 
microdeletion  (7,  8)  and  in  mouse  by  targeted  inactivation  (9-11).  Neurofibromin  is  a  tumor  suppressor 
that  functions,  as  least  in  part,  as  a  negative  regulator  of  Ras  (reviewed  in  (12-14). 

Submicroscopic  microdeletion  involving  the  entire  NF1  tumor  suppressor  gene  at  chromosome 
band  17qll.2  accounts  for  an  estimated  5-20%  of  NF1  cases  (15-19).  Although  they  represent  only  a 
fraction  of  causal  mutations,  analysis  of  NF1  microdeletions  is  of  great  interest  for  clinical,  pathological, 
and  molecular  genetic  reasons.  Clinically,  microdeletions  are  the  only  NF1  genotype  known  to  be 
consistently  associated  with  a  particular  phenotype.  Subjects  with  microdeletions  are  remarkable  for  a 
predisposition  to  facial  anomalies  and  childhood  onset  of  dermal  neurofibromas,  or  in  cases  where  age  of 
onset  is  unknown,  an  excessive  number  relative  to  age  (7,  8, 15, 18, 20-24).  A  few  deletion  cases  that  do 
not  show  an  early  age  of  onset  of  dermal  neurofibromas  have  been  reported  (15, 18,  24),  however,  since 
the  extent  of  the  deletions  was  not  delineated,  it  is  unclear  whether  the  same  loci  are  involved.  In 
contrast,  about  80%  of  NF1  cases  are  caused  by  subtle  intragenic  private  mutations  that  predict  truncation 
of  neurofibromin  (19,  25).  Among  these  subjects,  no  correlation  has  been  detected  between  mutation  type 
and/or  location  and  the  development  of  specific  manifestations  (26-28). 

Pathologically,  NF1  microdeletions  are  important  because  they  hold  the  promise  of  identifying  a 
modifying  gene  that  potentiates  dermal  neurofibromagenesis.  The  correlation  between  microdeletion  and 
the  early  onset  tumor  phenotype  led  us  to  hypothesize  that  dermal  neurofibromas  developed  earlier  due  to 
haploinsufficiency  for  both  neurofibromin  and  the  product  of  an  unknown  closely  linked  gene,  designated 
as  NPL  (neurofibroma-potentiating  locus)(8,  23,  29,  30).  Two  models  have  been  proposed  for  how  NPL 
functions  to  potentiate  dermal  neurofibroma  development  (29).  Haploinsufficiency  for  NPL  could 
increase  the  frequency  of  somatic  second-hit  mutations  in  the  NF1  gene.  Homozygous  inactivation  of  the 
NF1  gene  has  been  demonstrated  in  dermal  neurofibromas  (31-34).  A  possible  mechanism  may  be 
genomic  instability  in  microdeletion  patients,  which  is  intriguing  in  view  of  reports  detecting  cytogenetic 
abnormalities  and  microsatellite  instability  in  some  neurofibromas  (35, 36).  A  second  model  proposes 
that  NPL  haploinsufficiency  increases  the  probability  that  a  neurofibromin-deficient  progenitor  Schwann 
cell  will  manifest  as  a  dermal  neurofibroma.  For  example,  NPL  could  encode  (or  regulate)  a  cytokine, 
cell  cycle  regulator,  tumor  suppressor,  or  oncogene  that  exerts  a  positive  proliferative  advantage  on  a 
neurofibromin-deficient  cell. 

Molecularly,  NF1  microdeletions  in  humans  are  important  because  they  typically  arise  by 
paralogous  recombination  at  specific  sites  (30).  We  recently  proposed  using  the  term  paralogous 
recombination  (29)  to  describe  homologous  recombination  between  paralogs,  which  has  also  been 
referred  to  as  nonallelic  homologous  recombination  (37)  or  ectopic  homologous  recombination  (38). 
Paralogs,  also  known  in  the  human  disease  literature  as  low-copy  repeats  (LCR)  (39,  40),  duplicons  (41), 
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or  repeats  (REPs)  (42, 43),  are  DNA  sequences  of  high  identity  that  derive  from  duplication  of  a  gene  or 
segment  within  a  genome  (44, 45).  An  estimated  80%  of  NF1  microdeletions  arise  by  paralogous 
recombination  between  the  NF1REP-P  and  NF1REP-M  paralogs  that  flank  the  NF1  gene  resulting  in 
deletions  about  1.5  Mb  in  length  (30, 46).  NF1  microdeletions  can  be  familial  or  de  novo  (8, 21, 23,  30, 
47).  New  microdeletions  preferentially  occur  on  the  maternally-derived  allele  (17, 48)  and  detection  of 
crossovers  flanking  the  deleted  region  is  consistent  with  a  mechanism  of  unequal  crossing  over  between 
misaligned  NFIREPs  during  maternal  meiosis  I  (48).  These  NF1REP  elements  are  51  kb  in  length  (S.H. 
Forbes  et  al.,  unpublished  data)  and  the  microdeletion  region  harbors  at  least  11  genes  in  addition  to  the 
350  kb  NF1  gene  (30, 49).  Which  gene  may  represent  the  NPL  locus  is  unknown,  as  the  functions  of 
these  genes  remain  to  be  determined. 

Previously,  we  described  a  2  kb  hotspot  where  paralogous  recombination  between  NF1REP-P1 
and  NF1REP-M  resulted  in  a  recurrent  1.5  Mb  NF1  microdeletion  (20).  Forty-six  percent  of  patients 
(N=54),  known  to  be  hemizygous  for  at  least  the  NF1  gene,  had  breakpoints  clustered  at  this  hotspot. 
Towards  elucidating  the  mechanism  of  NFIREP-mediated  NF1  microdeletions  and  narrowing  the  critical 
region  of  the  NPL  gene,  we  mapped  and  sequenced  additional  breakpoints.  Here  we  report  the 
identification  a  second  distinct  paralogous  recombination  hotspot,  extension  and  reginement  of  the  initial 
hotspot  region,  and  the  development  of  deletion  junction-specific  PCR  assays  that  detected  69%  of  NF1 
microdeletions. 

RESULTS 

Identification  and  amplification  of  deletion  breakpoints 

In  the  course  of  characterizing  the  initial  2  kb  paralogous  recombination  hotspot  (20),  we  found  that 
breakpoints  in  6  microdeletion  cases  were  not  located  at  the  hotspot  but  had  previously  been  mapped 
within  the  NF1REP  paralogs  (30).  This  suggested  the  possibility  of  additional  recombination  hotspots 
within  the  paralogs.  We  identified  and  sequenced  the  breakpoints  of  these  6  cases  (UWA1 19-1, 
UWA123-1,  UWA166-2,  UWA169-1,  UWA172-1,  and  UWA176-1),  which  have  been  described 
previously  (8, 23,  30).  Because  the  NF1REP  paralogs  share  98%  sequence  identity  (30)  it  was  necessary 
to  identify  paralogous  sequence  variants  (PSV)  (50),  i.e.,  nucleotide  variants  specific  for  NF1REP-P1  and 
NF1REP-M,  to  determine  the  breakpoint  interval  in  the  recombinant  NF1REP.  Because  of  the  recent 
identification  of  an  additional  NF1REP  proximal  to  the  NF1  gene  (Dieter  Jenne,  et  al.,  unpublished  data; 
Stephen  Forbes,  et  al.,  unpublished  data),  the  previously  named  NF1REP-P  (30)  is  now  referred  to  as 
NF1REP-P1  (Figure  1).  We  employed  our  previously  successful  strategy  (20)  of  designing  primers  from 
the  known  sequence  of  NF1REP-P1  and  amplifying  that  segment  from  a  somatic  cell  hybrid  line  carrying 
only  the  patient’s  chromosome  17  with  the  deletion.  Direct  sequencing  revealed  whether  the  pattern  of 
PS  Vs  was  representative  of  NF1REP-P1  or  NF1REP-M,  or  if  it  transitioned  from  NF1REP-P1  to 
NF1REP-M  as  expected  if  the  amplicon  contained  the  breakpoint  of  the  recombinant  NF1REP. 
Simultaneously,  BAC  271K1 1  carrying  NF1REP-P1  and  BACs  640N20  and  951F1 1  carrying  NF1REP- 
M  (30)  were  also  amplified  and  sequenced.  Amplification  of  BACs  271K1 1  and  640N20  served  as 
positive  controls,  while  amplification  of  BAC  951F1 1  provided  sequences  not  completed  by  the  Human 
Genome  Mapping  Project  at  that  time.  Analysis  of  these  data  identified  two  clusters  of  breakpoints.  The 
first  cluster  was  4,154  bp  in  length  and  designated  as  paralogous  recombination  site  1  (PRSl)(Figure  1). 
The  second  cluster  was  6,315  bp  in  length  and  extended  the  cluster  harboring  the  previously  defined  2  kb 
recombination  hotspot.  This  cluster,  designated  as  paralogous  recombination  site  2  (PRS2),  is  about  15 
kb  telomeric  to  PRS1  (Figure  1). 
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NF1  microdeletion  junction-specific  PCR  assays  were  developed  by  judicious  primer  design  that  took 
advantage  of  PSVs.  Figure  2A  shows  the  results  of  two  different  PRS1  assays.  The  assay  with  primers 
mdf4-F  and  md30-R  was  robust  and  gave  a  3  kb  junction  fragment.  A  longer  7  kb  amplicon  of  the 
deletion  junction  generated  with  primers  md4-F  and  md4-R  detected  all  known  breakpoints  in  the  PRS1 
region  (Figures  2A,  2B,  and  3  A).  Occasionally,  the  7  kb  PRS1  assay  showed  a  smear  of  products,  which 
should  not  be  misinterpreted  as  a  positive  result  (Figure  2B).  In  this  instance,  the  assay  of  normal 
genomic  DNA  shows  a  smear  of  products  without  the  intense  7  kb  junction  fragment  (UWA183-1). 
Confirmation  that  this  result  is  a  true  negative  is  shown  by  absence  of  the  7  kb  junction  fragment  from 
genomic  DNA  of  a  somatic  cell  hybrid  line  (UWA183-1-H#15)  carrying  only  the  PRS2-mediated  deleted 
chromosome  17  of  patient  UWA1 83-1  (Figure  2B).  We  also  verified  that  this  pattern  is  not  due  to 
amplification  of  NFIREP-like  sequences  on  chromosome  19  (Figure  2B).  Although  it  is  unclear  why 
some  genomic  DNA  samples  give  this  diffuse  pattern  of  amplicons,  nonspecific  products  from  both  PRS1 
and  PRS2  can  be  minimized  by  the  use  of  high  purity  DNA  (see  Materials  and  Methods). 

Our  previously  reported  deletion  junction-specific  assay  for  breakpoints  at  the  2  kb  recombination 
hotspot  utilized  primers  DCF  and  DTR,  which  gave  a  3.4  kb  amplicon  (20).  For  this  assay,  instead  of  die 
DCF  primer,  we  now  routinely  use  md21-F,  whose  one  additional  base  at  the  5’  end  made  the  assay  more 
reliable  and  robust  (Figure  2C,  Table  1).  A  new  assay  was  needed  to  amplify  the  extended  length  of  the 
breakpoint  cluster  (now  designated  as  PRS2).  The  new  assay  utilized  primers  md21-F  and  md7-R  and 
amplified  a  7  kb  junction  fragment  (Figure  2C,  Table  1).  While  this  assay  detects  a  greater  number  of 
deletion  breakpoints,  we  have  found  that  it  can  be  less  robust  and  more  sensitive  to  the  purity  of  the 
genomic  DNA  template  compared  to  the  smaller  amplicon  assays  (see  Materials  and  Methods). 

NF1  microdeletion  breakpoint  sequence  analyses 

To  identify  the  precise  breakpoint  intervals  within  PRS1  and  PRS2,  deletion  junction-specific 
amplicons  (PRS1  7  kb  and  PRS2  7  kb)  were  sequenced  directly  using  a  series  of  internal  primers.  To 
facilitate  differentiating  PSVs  of  NF1REP-P1  and  NF1REP-M  from  allelic  polymorphisms  (e.g.,  single 
nucleotide  polymorphisms,  SNP),  we  also  amplified,  cloned,  and  sequenced  nonrecombinant  PRS1  and 
PRS2  regions  from  both  NF1REP-P1  and  NF1REP-M  of  8  normal  chromosomes  and  of  NF1REP-M 
carried  in  BAC  951F11. 

Sequenced  breakpoints  at  PRS1.  The  sequences  of  14  breakpoints  defined  the  PRS1  breakpoint 
cluster  as  4,154  bp  in  length  (Figure  3  A).  Breakpoint  distribution  was  skewed  with  64%  mapping  to  a 
551  bp  hotspot.  The  remaining  breakpoints  occurred  in  unique  sequence  intervals.  The  shorter  3  kb 
PRS1  amplicon  assay  (primers  mdr4-F  and  md30-R)  detected  13/14  of  the  breakpoints  (Figure  3A, 
breakpoints  between  122,140  and  1 24,22 l)(Table  2).  The  sequence  of  the  170  bp  breakpoint  interval  for 
patient  UWA123-3  was  consistent  with  a  gene  conversion  event  (Figure  3B).  Since  the  patient’s 
microdeletion  was  paternal  in  origin,  we  cloned  and  sequenced  both  alleles  of  NF1REP-P1  and  NF1REP- 
M  of  the  father.  These  data  identified  the  paternal  NF1REP-P1  and  NF1REP-M  alleles  that  served  as  the 
recombination  substrates  and  confirmed  the  alternating  pattern  of  PSV  and  SNP  between  NF1REP-P1  and 
NF1REP-M  across  the  breakpoint  region  (Figure  3B). 

Sequenced  breakpoints  at  PRS2.  The  sequences  of  29  breakpoints,  including  25  previously 
reported  (20)  that  are  summarized  in  Figure  3C,  defined  the  PRS2  breakpoint  cluster  as  6,315  bp  in 
length.  Breakpoint  distribution  was  skewed  with  86%  (25/29)  mapping  to  a  2,292  bp  hotspot.  The 
previously  reported  3.4  kb  PRS2  assay  (primers  DCF  and  DTR)  (20)  or  the  modified  md21-F/deljunl-R 
assay  (Figure  2C),  detected  25/29  cases  (Figure  3B,  from  141,992  to  144,299  bp).  For  the  cases  whose 
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breakpoint  intervals  were  published  previously  (20),  it  should  be  noted  that  some  intervals  have  increased 
in  length.  Here  we  included  the  sequences  of  7  NF1REP-P1  and  NF1REP-M  alleles  from  control 
individuals,  along  with  sequenced  BACs,  to  differentiate  between  common  SNPs  and  PSV.  The 
identification  of  base  142,662  as  a  SNP  resulted  in  extending  the  intervals  of  14  patients  to  the  next  PSV 
at  142,728.  Similarly  the  identification  of  143,981  and  143,998  as  SNPs  extended  the  interval  of  7 
patients  to  the  next  PSV  at  144,299.  The  originally  reported  17  bp  recombination  interval  for  patient  99- 
1(20)  was  confirmed  by  sequence  analysis  of  the  maternal  progenitor  alleles  that  served  as  recombination 
substrates,  thereby  documenting  that  they  carried  the  SNPs  as  shown  (Figure  3C). 

Frequency  of  PRS1-  and  PRS2-mediated  A/F7  microdeletions 

To  study  a  larger  cohort  of  NF1  deletion  patients  and  determine  the  frequency  of  paralogous 
recombination  events  at  PRS1  and  PRS2,  we  performed  deletion  junction-specific  assays  on  additional 
patients  known  to  be  hemizygous  for  all,  or  nearly  all,  of  the  NF1  gene.  A  total  of  78  patients  were 
studied,  including  those  whose  breakpoints  were  sequenced  (Figure  3).  These  analyses  showed  that  14/78 
(18%)  of  deletions  occurred  at  PRS1,  while  40/78  (51%)  occurred  at  PRS2  (Table  2).  The  more  robust, 
shorter  amplification  assays  detected  the  majority  of  cases.  Primers  md4-F  and  md30-R  detected  13/14  of 
the  total  PRS1  recombination  events  and  primers  md21F  and  deljunl-R  (or  DCF  and  DTR  (20)  detected 
33/40  (82%)  of  the  total  PRS2  recombination  events  (Table  2).  Neither  the  7  kb  PRS1  nor  the  7  kb  PRS2 
deletion-junction  assays  generated  products  upon  amplification  of  genomic  DNA  from  the  peripheral 
blood  of  75  control  individuals. 

There  was  no  significant  difference  between  PRS1-  and  PRS2-mediated  microdeletions  for  the 
parent  of  origin  or  for  de  novo  versus  familial  cases  (Figure  3)(20).  In  our  series  of  59  cases  where  we 
have  clinical  data  for  the  parents,  6  (10%)  inherited  the  NF1  microdeletion.  Among  45  de  novo  cases 
where  parental  origin  of  the  microdeletion  was  determined,  36/45  (80%)  were  maternally  derived,  while 
9/45  (20%)  were  paternally  derived. 


DISCUSSION 

By  employing  sequence  analysis  of  deletion  junctions  and  deletion  junction-specific  PCR  assays, 
we  found  that  the  paralogous  recombination  events  mediating  NF1  microdeletions  occurred  preferentially 
at  two  discrete  sites.  Sequence  analyses  of  deletion  junction  fragments  of  recombinant  NFIREPs,  along 
with  the  comparable  region  of  nonrecombinant  NF1REP-P1  and  NF1REP-M  in  control  individuals, 
identified  PSVs  that  were  used  to  delimit  the  paralogous  recombination  sites  PRS1  and  PRS2  to  4,154  bp 
and  6,315  bp,  respectively.  Together,  recombination  at  these  two  distinct  sites  accounted  for  69%  of  NF1 
microdeletion  cases  (N=78)  and  of  these  74%  occurred  at  PRS2.  Within  each  PRS  region,  NF1 
microdeletion  breakpoints  were  clustered  (Figure  3).  Sixty-four  percent  (9/14)  of  sequenced  PRS1 
breakpoints  occurred  in  a  551  bp  interval,  while  86%  (25/29)  of  sequenced  PRS2  breakpoints  occurred  in 
a  2,292  bp  interval.  The  breakpoint  frequency  decreased  in  a  generally  symmetrical  fashion  on  both  sides 
of  the  PRS1  and  PRS2  hotspots,  which  is  consistent  with  an  initiation  site  for  crossing  over  in  this  region 
as  predicted  by  the  double-strand  break  repair  model  (51)  and  similar  to  maps  of  strand  exchange  at  other 
recombination  hotspots  in  mammals  (52,  53).  Both  familial  and  de  novo  microdeletions  occurred  at  both 
PRSs.  Eighty  percent  (N=45)  of  de  novo  NF1  microdeletions  occurred  on  the  maternally-derived 
chromosome  17  (this  manuscript;  (20),  which  is  the  same  frequency  reported  in  a  previous  smaller  study 
(17, 48).  However,  both  maternally-  and  paternally-derived  microdeletions  occurred  at  both  PRS1  and 
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PRS2.  The  work  of  Jenne  et  al.  (54)  provides  independent  confirmation  of  breakpoint  hotspots  at  both  of 
the  PRS  intervals.  Of  4  reported  microdeletion  cases,  one  had  a  breakpoint  in  the  PRS1  region  and  3  had 
breakpoints  in  the  PRS2  region. 

Sequence  analysis  of  the  PRS1  deletion  junction  fragment  of  patient  UWA123-3  revealed  a  complex 
pattern  of  alternating  NF1REP  sequences  from  NF1REP-P1/-  M/-  PI/-  M  (Figure  3B).  This  sequence 
pattern  was  not  due  to  polymorphism(s)  in  this  individual  as  demonstrated  by  sequence  analysis  of  the 
nonrecombinant  NFIREPs  of  the  transmitting  parent.  These  data  are  consistent  with  gene  conversion 
during  double  strand  break  repair  as  described  in  yeast  (51,  55).  We  previously  described  similar  gene 
conversion  events  at  PRS2  for  two  NF1  microdeletion  patients  (20).  The  gene  conversion  tracts  in  the 
NFIREPs  are  relatively  short  (<670  bp),  which  is  consistent  with  the  typical  <2  kb  conversion  tracts  in 
yeast  (56, 57)  and  humans  (53,  58-62).  Sequence  analysis  of  other  NF1  microdeletion  patients  showed 
apparent  gene  conversion  tracts,  but  DNA  of  the  transmitting  parent  was  unavailable  for  confirmation. 
Therefore,  the  frequency  of  gene  conversion  events  in  our  series  of  NF1  microdeletions  is  likely 
underestimated. 

While  the  majority  of  NF1  microdeletion  breakpoints  occurred  in  intervals  of  >500  bp  in  length,  we 
were  surprised  that  several  breakpoints  occurred  in  small  intervals,  in  particular,  the  PRS2  deletion 
breakpoint  of  patient  99-1,  which  mapped  to  a  17  bp  interval  (Figure  3)(20).  In  this  case,  sequence 
analysis  of  the  nonrecombinant  NF1REP-P1  and  NF1REP-M  progenitor  alleles  of  the  transmitting  parent 
confirmed  the  interval  length  by  documenting  the  specific  nucleotides  at  the  polymorphic  sites  143,981 
and  143,998  as  shown  in  Figure  3C.  Several  other  NF1  microdeletion  breakpoints  mapped  to  intervals 
<300  bp  (Figures  3A,  C)  but  the  transmitting  parent  was  not  available  for  confirmation.  Studies  in  model 
organisms  support  a  relationship  between  the  length  of  perfect  uninterrupted  stretches  of  homology  and 
the  frequency  of  homologous  recombination  (57, 63-66).  Previous  data  suggest  that  the  length  of 
uninterrupted  homology  for  efficient  recombination  (minimal  efficient  processing  segment,  MEPS)  in 
humans  may  be  337-456  bp  (58,  67).  Mismatches  flanking  the  17  bp  recombination  interval  would  limit 
the  length  of  contiguous  perfect  homology  between  NF1REP-P1  and  NF1REP-M  and  most  likely  reduce 
the  recombination  frequency  in  this  small  interval.  Alternatively,  the  recombination  event  in  the  99-1 
case  may  have  originated  within  either  the  950  bp  or  310  bp  of  contiguous  homology  that  flank  the  17  bp 
breakpoint  interval  (Figure  3C),  propagated  through  the  heterology  at  one  border  of  the  interval,  and 
terminated  within  the  17  bp  segment.  It  has  been  shown  in  mouse  L  cells  that  recombination  events 
initiated  in  large  intervals  of  uninterrupted  homology,  can  propagate  and  terminate  in  adjacent  regions 
that  have  significant  mismatches  (68).  Furthermore,  in  humans  homologous  unequal  crossovers  in  vivo 
have  been  reported  in  small  intervals,  e.g.,  58  bp  at  the  (3-globin  cluster  (69)  and  26  bp  between  Alu 
repeats  (70). 

The  development  of  PCR-based  deletion  junction  assays  that  detect  the  majority  of  microdeletions 
spanning  the  entire  NF1  gene  will  facilitate  the  identification  of  additional  patients  for  both  molecular  and 
clinical  studies.  Importantly,  we  will  now  be  able  to  identify  cohorts  of  microdeletion  patients  with  the 
same  genotype  to  determine  the  spectrum  of  clinical  manifestations  associated  with  the  recurrent  PRS1 
and  PRS2  microdeletions.  For  purposes  of  phenotype/genotype  correlations,  cases  with  breakpoints  at 
PRS1  and  PRS2  may  have  virtually  the  same  genotype.  These  recombination  sites  are  about  15  kb  apart 
and  no  genes  are  known  to  map  in  this  interval  (30, 49). 

Currently,  genetic  diagnostic  laboratories  offer  clinical  and  prenatal  testing  for  NF1  microdeletion  by 
fluorescence  in  situ  hybridization  with  an  NF1  probe(s)  (http://www.genetests.org)(22, 47).  This  test  may 
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be  augmented  or  replaced  by  the  PRS1  and  PRS2  deletion  junction-specific  assays  described  here.  This 
would  reduce  costs,  increase  specificity,  and  still  be  reasonably  (but  less)  sensitive.  One  potential 
problem  in  using  PRS1  and  PRS2  deletion-specific  assays  in  a  clinical  setting  would  involve  the 
possibility  of  false  positives  due  to  a  somatic  NF1  microdeletion  that  results  in  low-level  mosaicism  in 
white  blood  cells  (20).  Our  negative  test  results  for  1 50  normal  chromosomes  indicate  that  this  is  not  a 
common  phenomenon  in  white  blood  cells  or  their  precursors.  Furthermore,  application  of  the  PRS1  and 
PRS2  deletion  junction  assays  to  the  healthy  parents  of  12  confirmed  de  novo  microdeletion  cases 
described  here  and  elsewhere  (20)  failed  to  detect  a  mosaic  parent.  An  affected  child  with  a  de  novo 
microdeletion  and  a  mosaic  parent  has  been  described  (24),  but  it  is  not  known  if  the  breakpoint  occurred 
at  either  PRS. 

Although  greater  than  25  single  gene  or  chromosomal  disorders  are  caused  by  rearrangements 
mediated  by  recombination  between  repeats,  referred  to  as  genomic  disorders  (37, 71)  or  paralogous 
recombination  disorders  (29),  breakpoints  have  been  sequenced  in  relatively  few.  Evidence  of  apparent 
gene  conversion  in  the  peri-breakpoint  regions  of  NF1,  CMT1A,  AZFa,  and  IDS  rearrangements  (20,  58, 
59,  72-74)  are  consistent  with  a  mechanism  of  double  strand  break  repair  (DSB)  as  described  in  yeast  (72) 
and  recently  reviewed  in  mammals  and  other  eukaryotic  cells  (75, 76). 

Our  identification  of  PRS1  and  PRS2  as  preferential  breakpoint  sites  within  the  surrounding  51  kb 
NF1REP  region  of  98%  identity  implies  that  there  is  an  additional  factor(s),  such  as  a  sequence  motif, 
functional  domain,  or  higher  order  chromosomal  feature,  that  influences  the  site  of  the  DSB  and/or  the 
site  of  strand  exchange,  which  can  be  distinct  from  the  DSB  site  due  to  migration  of  a  Holliday  junction. 
Among  known  NFIREP-mediated  microdeletions,  there  is  a  significant  preference  for  recombination  at 
PRSs  versus  the  remaining  NF1REP  sequence.  A  previous  study  provides  an  estimate  of  the  percent  of 
NF1  microdeletions  with  breakpoints  in  NF1REP-P1  and  -M  (30).  From  a  series  of  17  independent 
microdeletions,  we  determined  that  14/17  microdeletions  had  breakpoints  in  these  NFIREPs.  In  this 
study,  sequence  analyses  determined  that  13  of  these  occurred  in  either  PRS1  or  PRS2  (20),  while  only  1 
occurred  in  another  region  of  the  NF1REP  (30).  The  preference  for  recombination  in  either  PRS1  or 
PRS2  together  (10.4  kb  region)  versus  the  remainder  of  the  NF1REP  paralogs  (51  kb)  is  63:1  as 
determined  by  calculation  of  the  relative  risk.  This  is  comparable  to  the  preference  for  paralogous 
recombination  at  the  hotspot  that  mediates  the  1.4  Mb  duplication  and  1.4  Mb  deletion  that  cause 
Charcot-Marie-Tooth  type  1 A  (CMT1  A)  and  hereditary  neuropathy  with  liability  to  pressure  palsies, 
respectively  (reviewed  in  (77).  Despite  98.7%  sequence  identity  between  the  two  24  kb  CMT1AREP 
paralogs  (78),  about  75%  (N=24)  of  crossovers  occur  in  a  1.7  kb  hotspot,  giving  a  relative  risk  of  about 
50: 1  (58, 72).  Breakpoint  hotspots  have  also  been  defined  for  AZFb  and  AZFc  deletions  in  men  with 
spermatogenic  failure  (38).  In  contrast  to  NF1  and  CMT1A,  these  recombination  events  occurred  between 
massive  palindromic  sequences,  which  are  known  to  be  sites  of  preferential  double  strand  breaks  in  vitro 
(79).  A  mariner  insect  transposon  element  (MITE),  located  about  1  kb  from  the  CMT1 A  hotspot,  has 
been  proposed  as  a  preferred  site  of  double  strand  breaks  that  may  initiate  recombination  (80,  81).  A 
search  for  known  recombinogenic  motifs  and  replication-associated  sequences  near  the  2  kb 
recombination  hotspot  (short  PRS2  deletion  junction  assay)  for  the  NF1  microdeletion  identified  a  nearby 
%-like  element  (20),  which  is  known  to  stimulate  recombination  in  E.  coli  (reviewed  in  (82).  However,  no 
%-like  element  was  in  proximity  of  PRS  1 .  Additionally,  neither  palindromes  nor  MITEs  were  found  in 
proximity  to  either  PRS  1  or  PRS2.  However,  a  complete  analysis  of  PRS  1  and  PRS2  awaits  the  full 
sequence  and  structure  analysis  of  NF1REP-P1  and  NF1REP-M. 
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MATERIALS  AND  METHODS 

Subjects  and  cell  lines 

Peripheral  blood  samples  were  obtained  after  informed  consent  from  NF1  microdeletion  patients 
and  their  parents,  when  available.  The  presence  of  an  NF1  microdeletion  has  been  reported  previously  in 
most  of  the  patients  (15-17,  30,  83).  In  all  cases,  the  NF1  microdeletion  was  known  to  extend  beyond  the 
borders  of  the  gene.  Rodent/human  somatic  cell  hybrid  lines  were  constructed  for  a  subset  of  the  patients 
(30)  to  facilitate  breakpoint  mapping  and  sequence  analyses  of  the  deleted  chromosome  17  homolog. 

Identification  of  NF1  breakpoints 

Breakpoints  were  mapped  in  somatic  cell  hybrids  containing  the  chromosome  17  homolog  that 
carried  the  NF1  microdeletion  (8,  30)  by  direct  sequencing  using  the  strategy  described  previously  (30). 

Nonrecombinant  PRS1  and  PRS2  regions  from  normal  individuals  were  sequenced  from 
NF1REP-P1-  and  NFIREP-M-specific  amplicons  and  from  BACs  known  to  map  at  either  NF1REP.  The 
nonrecombinant  PRSl  region  of  NF1REP-P1  was  amplified  with  primers  md4-F  and  126Kproxl-R  and 
the  nonrecombinant  PRSl  region  of  NF1REP-M  was  amplified  with  primers  md31M-F  and  md4-R  (Table 
1).  Both  reactions  were  performed  in  50  pi  reaction  volume  using  the  Expand  Long  Template  PCR 
system  (Roche,  Indianapolis)  containing  200-300  ng  genomic  DNA,  5  pi  of  10X  buffer  #1,  3.75  U 
polymerase,  350  pM  each  dNTP,  and  0.3  pM  each  primer.  After  heating  to  95°C  for  5  min,  35cycles  of 
amplification  were  performed  at  95°C  for  30  s,  62°C  for  30  s  for  NF1REP-P1  or  62°C  for  30  s  for 
NF1REP-M,  and  68°C  for  5  min,  with  a  final  extension  of  68°C  for  15  min.  Amplicons  were  column 
purified  (Qiagen,  Valencia,  CA,  cat#28706)  and  cloned  into  the  TOPO  XL  PCR  cloning  kit  (Invitrogen, 
Carlsbad,  CA).  To  confirm  presence  of  insert,  clones  were  amplified  with  md32-F  and  md32-R  in  a  50  pi 
reaction  containing  2  pi  of  overnight  culture  of  E.  coli  in  LB,  5  pi  1  OX  Buffer  #11  (Gene Amp,  Applied 
Biosystems,  Foster  City,  CA),  200  pM  each  dNTP,  1,5  mM  MgCl2,  0.3  pM  each  primer,  and  2.5  U 
TaqGold  polymerase.  After  an  initial  incubation  at  95°C  for  10  min,  35  cycles  of  amplification  were 
performed  at  95°C  for  1  min,  60°C  for  lmin,  and  72°C  for  1  min,  with  a  final  extension  at  72°C  for  8  min. 
Plasmid  preparations  of  clones  were  column  purified  (Qiagen,  Cat#27106)  and  sequenced  directly  using 
internal  primers  as  described  below. 

The  nonrecombinant  PRS2  region  of  NF1REP-P1  was  amplified  with  primers  mddjlb-F  and 
P148prx-R  using  the  Expand  Long  Template  PCR  system  in  a  50  pi  reaction  volume  containing  200-300 
ng  genomic  DNA,  5  pi  10X  Buffer  #3,  500  pM  each  dNTP,  0.3  pM  each  primer,  and  3.75  U  polymerase. 
After  an  initial  incubation  at  95°C  for  5  min,  35  cycles  of  amplification  were  performed  at  95°C  for  30  s, 
64°C  for  30  s,  and  68°C  for  5  min,  with  a  final  extension  of  68°C  for  15  min.  The  nonrecombinant  PRS2 
region  of  NF1REP-M  was  amplified  with  primers  4400S4-F  and  P148MED-R  using  the  Expand  Long 
Template  PCR  system  in  a  50  pi  reaction  volume  containing  the  same  components  as  the  above  reaction 
for  NF1REP-P1  amplification.  After  an  initial  incubation  at  95°C  for  5  min,  35  cycles  of  amplification 
were  performed  at  95°C  for  30  sec,  62°C  for  30  sec,  and  68°C  for  5  min,  with  a  final  extension  of  68°C 
for  15  min.  Amplicons  were  column  purified  (Qiagen,  Cat#28706)  and  cloned  into  the  TOPO  XL  PCR 
cloning  kit  (Invitrogen).  To  confirm  presence  of  insert,  clones  were  amplified  with  RB4-F  and  RB4-R  in 
a  50  pi  reaction  containing  2  pi  of  overnight  culture  of  E.coli  in  LB,  5  pi  1  OX  Buffer  #11  (GeneAmp, 
Applied  Biosystems),  200  pM  each  dNTP,  1.5  mM  MgCl2,  0.3  pM  each  primer,  and  2.5  U  TaqGold 
polymerase.  After  an  initial  incubation  at  95  °C  for  10  min,  35  cycles  of  amplification  were  performed  at 


9 


» 


95°C  for  1  min,  60°C  for  lmin,  and  72°C  for  1  min,  with  a  final  extension  at  72°C  for  8  min.  Plasmid 
preparations  of  clones  were  purified  (Qiagen,  Cat#27106)  and  sequenced  directly  using  internal  primers 
as  described  below. 

All  amplifications  used  a  mixture  of  dNTPs  (Roche  Cat#l  58 1295)  and  were  performed  in  a  MJ 
Research  Cycler  (model  PTC-200  or  PTC- 100).  Amplicons  were  sequenced  by  cycle  sequencing  using 
either  the  Big-Dye  Terminator  or  the  SequiTherm  EXCELII  Long  Read  (Epicentre,  Madison,  WI)  kits. 
Extension  products  were  analyzed  an  either  an  ABI  377  sequencer  (Applied  Biosystems)  or  an  A.L.F. 
sequencer  (Pharmacia,  Uppsala,  Sweden).  Raw  nucleotide  sequences  were  analyzed  with  Sequencher 
(GeneCodes,  Ann  Arbor,  MI). 

PRS1  and  PRS2  deletion  junction  fragment  assays 

The  3  kb  PRS1  deletion  junction  fragment  assay  utilized  primers  md4-F  and  md30-R  and  the  7  kb 
PRS1  assay  utilized  primers  md4-F  and  md4-R  (Table  1).  Assay  conditions  for  both  were  a  50  pi  reaction 
volume  using  the  Expand  Long  Template  PCR  System  (Roche  Molecular  Systems,  Indianapolis,  IN)  with 
200-300  ng  genomic  DNA,  0.3  pM  each  primer,  350  pM  each  dNTP,  5  pi  1  OX  PCR  buffer  #1,  and  3.75 
U  polymerase.  After  heating  to  95°C  for  5  min,  35  cycles  of  amplification  were  performed  at  95°C  for  30 
s,  64°C  for  30  s,  and  68°C  for  5  min,  with  a  final  extension  of  68°C  for  1 5  min. 

The  3.4  kb  PRS2  assay  utilized  primers  md21-F  and  deljunl-R  (Table  1)  (previously  reported  as 
DCF  and  DTR  (20).  Here  we  report  an  alternate  ~3.4  kb  PRS2  assay  with  forward  primer  mddj  lc,  which 
is  identical  to  md21-F  except  for  an  additional  T  at  the  5’  end.  The  7  kb  PRS2  assay  was  performed  with 
primers  md21-F  and  md7-R.  The  conditions  for  all  three  assays  were  a  50  pi  reaction  volume  using  the 
Expand  Long  Template  PCR  System  (Roche  Molecular  Systems,  Indianapolis,  IN)  with  200-300  ng 
DNA,  0.15  pM  each  primer,  350  pM  each  dNTP,  5  pi  10X  PCR  buffer  #1,  and  3.75  U  polymerase.  After 
heating  to  95°C  for  5  min,  35  cycles  of  amplification  were  performed  at  95°C  for  30  s,  68°C  for  5  s,  and 
68°C  for  5  min,  with  a  final  extension  of  68°C  for  15.  In  addition,  the  PRS2  7  kb  assay  had  250  mM 
betaine  (Sigma,  BO300). 

All  primers  designed  for  amplification  of  NF1REP  sequences  were  based  on  homology  or 
heterology,  between  NF1REP-P1,  NF1REP-M,  and  another  NF1REP-E 19  at  chromosome  19p  on  BACs 
548K16  and  2189E23.  All  amplifications  used  a  mixture  of  dNTPs  (Roche,  Cat#1581295)  and  were 
performed  in  a  MJ  Research  Cycler  (South  San  Francisco,  CA,  model  PTC-200  or  PTC- 100).  DNA  must 
be  of  high  quality  to  avoid  non-specific  products  in  the  PRS1  and  PRS2  assays.  We  routinely  used  the 
Puregene  DNA  Isolation  Kit  (Gentra  Systems,  Minneapolis,  MN,  Cat.  D-5000)  to  purify  DNA  from 
peripheral  blood  and  immortalized  cell  lines.  DNA  that  had  been  extracted  previously  using 
phenol/chloroform  protocols,  purified  by  other  investigators,  or  stored  for  an  extended  period  of  time  was 
“repurified”  using  the  DNeasy  Tissue  Kit  (Qiagen).  PCR  products  were  visualized  by  ethidium  bromide 
staining  after  electrophoresis  through  a  0.8%  agarose  gel  with  TAE  buffer.  Because  the  longer  PRS1  and 
PRS2  involve  amplifying  a  7  kb  fragment,  all  samples  were  pretested  to  determine  if  the  DNA  was  of 
sufficient  quality  for  robust  amplification  of  large  fragments.  For  this  purpose,  we  typically  used  primers 
md31m-F  and  md4-R  (Table  1),  which  amplify  a  7  kb  segment  of  NF1REP-M.  None  of  assays  described 
in  this  manuscript  amplified  the  NFIREP-like  sequences  on  chromosome  19  as  determined  by  lack  of 
amplification  of  cell  line  10449,  which  is  a  monochromosomal  human  chromosome  19  cell  line  (NIGMS 
Human  Genetic  Cell  Repository;  Coriell  Institute,  Camden,  NJ)(Figure  2B  and  data  not  shown). 
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Table  1.  Primers  for  analysis  of  NF1REP  paralogous  recoml 

Dination  sites. 

Segment 

location 

Han 

Primer 

Name 

Sequence  (5’  — >3’) 

PRS1 

NF1REP-P1 

md4-F 

126Kproxl-R 

ATCCTGTAGCCCAGTTCTCCTC 

AACACAAGTTGGGAGCCTTCTG 

PRS1 

NF1REP-M 

md31m-F 

md4-R 

CCCAGTTCCCTAAACCTCCAG 

ACAGTCTCGCTCTGATGCCCGT 

internal 

PRS1 

NF1REP-P1 

and 

NF1REP-M 

0.2 

md32-F 

md32-R 

AGGCTAGAGCAGAAGCTAACATCA 

GGTTTCAGTGAGCTGAGATCGTG 

PRS2 

NF1REP-P1 

7.5 

Mddjlb-F 

P148prx-R 

AACCT  CCC  AGGCTCCCGA  AT  A 
TGAACAGGCCCATTTCTTGTC 

PRS2 

NF1REP-M 

6 

4400s4-F 

P148med-R 

ACTGTGCCCAGCATCTTGGT 

TTCTGATACTGAACTGGCTTAGG 

internal 

PRS2 

NF1REP-P1 

and 

NF1REP-M 

0.15 

CGGAGCGATGACTCCATCAC 

TCCTTGCCTGGAAACCTTGC 

PRS1 

deletion 

junction 

Recombinant 

NF1REP-P1 

and-M 

3 

md4-F 

md30-R 

See  above 

CAGTGAACCGAGATTGTACTACCA 

7 

md4-F 

md4-R 

See  above. 

See  above 

PRS2 

deletion 

junction 

Recombinant 

NF1REP-P1 

and-M 

3.4 

md21-F 

deljunl-R1 

CAACCTCCCAGGCTCCCGAA 

AGCCCCGAGGGAATGAAAAGC 

3.4 

mddjlc-F 

deljunl-R1 

TCAACCTCCCAGGCTCCCGAA 

See  above. 

7 

md21-F 

md7-R 

See  above 

GGCAGGGTTAGGAGCTCTGGA  A 

'Reported  previously  as  primer  DTR  (Lopez-Correa,  2001  #939). 
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Table  2. 


NF1  microdeletion  cases  detected  by  PRS  deletion  junction  assays. 


Deletion  Junction 

Assay 

Shorter  amplicon 
(3-3.4  kb) 

Extended  amplicon 
(7  kb) 

Junction  fragment 
sequenced  (# 
cases) 

PRS1 

13 

14 

14 

PRS2 

33 

40 

29 

Total 

46/78 

54/78 

FIGURE  LEGENDS 


Figure  1.  Schematic  of  the  NF1  gene  region  at  chromosome  segment  17ql  1.2.  The  350  kb  NF1  gene  is 
shown  along  with  the  5 1  kb  flanking  paralogs  denoted  as  NF 1 REP-P 1  (previously  known  as  NF 1 REP-P; 
(30)  and  NF1REP-M,  which  are  in  direct  orientation.  Paralogous  recombination  between  these  NF1REP 
elements  results  in  a  recurrent  1.5  Mb  deletion  of  the  entire  NF1  locus  (30).  The  locations  of  the 
paralogous  recombination  sites,  PRS1  and  PRS2,  are  shown.  The  stippled  portion  of  PRS2  designates  the 
previously  reported  2  kb  recombination  hotspot  (20),  while  the  hatchmarks  designate  the  extended  portion 
of  the  site  reported  here.  The  basepair  coordinates  are  relative  to  BAC  271K1 1,  which  contains  NF1  REP- 
PI  (30). 

Figure  2.  NF1  deletion  junction-specific  assays  for  PRS1  and  PRS2.  A.  A  schematic  of  the  recombinant 
NF1REP  after  paralogous  recombination  in  PRS1  has  deleted  the  intervening  1 .5  Mb  and  the  NF1  gene. 
The  relative  positions  of  primers  in  NF1  REP-P  1  and  -M  are  shown.  The  left  side  of  the  gel  shows  the  3 
kb  amplicon  generated  by  the  primers  md4-F  and  md30-R  for  patients  UWA119-1,  UWA123-3  and 
UWA240-1.  The  right  side  of  the  gel  shows  the  7  kb  amplicon  generated  by  primers  md4-F  and  md4-R. 
This  amplifies  a  larger  segment  that  can  detect  deletion  breakpoints  of  additional  patients  such  as 
UWA241-1.  No  amplicons  are  generated  in  either  assay  for  cases  UWA128-3  and  UWA156-1,  which 
have  breakpoints  at  PRS2.  B.  The  gel  documents  a  PRS1  assay  result  that  can  be  mistakenly  interpreted 
as  positive.  True  positives  for  PRS1  junction  assays  of  genomic  DNA  are  shown  for  patients  UWA1 19-1 
and  the  somatic  cell  hybrid  carrying  the  deleted  chromosome  17  of  patient  UWA1 19-1  and  UWA123-3- 
H#97.  Genomic  DNA  sample  UWA183-1,  which  should  be  negative  in  this  assay  since  it  has  a 
breakpoint  at  PRS2,  gives  a  smear  of  products  and  lacks  the  intense  7  kb  junction  fragment.  Confirmation 
that  this  result  is  a  true  negative  is  given  by  lack  of  7  kb  junction  fragment  from  genomic  DNA  of  somatic 
cell  hybrid  line  (UWA183-1-H#15)  carrying  the  deleted  chromosome  17  of  patient  UWA183-1.  CH19  is 
genomic  DNA  from  monochromosomal  19  somatic  cell  hybrid  line,  which  confirms  the  NF1REP-E19 
does  not  interfere  with  the  assay.  C.  A  schematic  of  the  recombinant  NF1REP  after  paralogous 
recombination  at  PRS2  deleted  the  intervening  1 .5  Mb  with  the  relative  positions  of  primers  in  NF1REP- 
P1  and  -M  shown.  The  left  side  of  the  gel  shows  the  3.4  kb  amplicon  generated  by  primers  DCF  and 
deljunl-R  (also  known  as  DTR  in  reference  (20)  for  patients  UWA128-3  and  UWA156-1.  Since  our 
earlier  report  (20),  we  have  replaced  primer  DCF  with  md21-F,  which  has  one  additional  base  at  the  5’ 
end,  which  makes  the  reaction  more  reliable  and  robust  (data  not  shown).  The  right  side  of  the  gel  shows 
the  7.5  kb  amplicon  generated  by  primers  md21-F  and  md7-R.  This  assay  detects  deletion  breakpoints  of 
additional  patients,  such  as  UWA172-1  and  UWA176.  No  amplicons  are  generated  in  either  assay  for 
UWA1 19-1  and  UWA123-3,  who  have  breakpoints  at  PRS1 .  In  all  gels,  NEG  indicates  the  negative 
control  without  DNA  template  and  MW  denotes  the  1  kb  ladder  of  molecular  weight  standards  (Promega). 

Figure  3.  NF1  microdeletion  breakpoints  at  PRS1  and  PRS2.  The  top  of  each  figure  shows  selected  PSV 
of  NF1REP-P1  and  NF1REP-M,  with  the  nucleotide  coordinates  according  to  the  BAC  271K1 1.  All  PSV 
between  121,991  and  123,963  are  shown  based  on  the  sequences  of  the  BACs  and  7  NF1REP-P1  and 
NF1REP-M  alleles  of  control  individuals.  PS  Vs  for  the  region  123,963  to  126,402  are  based  on  BAC 
sequences  only.  Coordinates  designated  with  the  “p”  suffix  indicate  SNPs,  however,  not  all  SNPs  are  not 
shown.  The  black  bars  denote  the  interval  where  paralogous  recombination  occurred  between  NF1  REP- 
PI  and  NF1REP-M  to  generate  the  microdeletion  breakpoint.  Parental  origin  is  M,  maternal;  P,  paternal; 
F,  familial;  U,  unknown.  A.  NF1  microdeletion  breakpoints  are  shown  for  14  patients  at  PRS1 .  The 
dashed  line  indicates  the  551  bp  hotspot  where  64%  (9/14)  of  breakpoints  in  PRS1  map.  The 
recombination  interval  for  case  UWA123-3  is  detailed  below.  B.  Details  of  PSVs  in  the  breakpoint 
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region  of  patient  UWA123-3  showing  a  pattern  consistent  with  gene  conversion.  The  breakpoint  interval 
of  this  patient  was  defined  by  sequencing  the  paternal  nonrecombinant  NF1REP-P1  and  NF1REP-M  that 
served  as  recombination  substrates  during  generation  of  the  patient’s  de  novo  deletion.  Three  SNPs 
(coordinates  with  “p”  suffix)  were  informative  in  defining  the  interval  and  demonstrating  gene 
conversion.  C.  NF1  microdeletion  breakpoints  are  shown  for  29  patients  at  PRS2.  *Indicates  previously 
reported  breakpoint  data  for  25  patients  (20),  which  were  modified  in  this  study  by  identification  of 
polymorphisms  from  normal  alleles  (see  Materials  and  Methods).  The  precise  breakpoint  intervals  and 
apparent  gene  conversion  events  in  cases  984412  and  973287  (among  the  14  patient  group)  have  been 
detailed  previously  (20).  All  PSV  between  140,852  and  144,507  are  shown;  however,  there  are  an 
additional  36  PSV  between  144,507  and  147,167  that  are  not  indicated.  The  dashed  line  indicates  the 
2,292  bp  hotspot  interval  where  86%  of  breakpoints  in  PRS1  map. 
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ABSTRACT 


Recombination  between  paralogs  that  flank  the  NF1  gene  at  17ql  1.2  typically  results  in  a  1.5  Mb 
microdeletion  that  includes  NF1  and  at  least  14  contiguous  genes.  The  principal  sequences 
responsible  are  two  51-kb  blocks  with  97.5%  sequence  identity  (NF1REP-P1-51  and  NF1REP- 
M-51).  Here  we  show  that  these  paralogs  belong  to  a  complex  group  of  paralogs  from  different 
sequence  families  with  three  components  on  17q  and  another  at  19pl3.  Multiple  breakpoint 
sequencing  reveals  two  paralogous  recombination  hotspots  within  the  51-kb  blocks.  We  seek  to 
explain  the  location  of  the  hotspots  based  on  sequence  motifs,  quality  of  alignment  match,  and 
nucleotide  sequence  identity.  Hotspot  locations  with  respect  to  relatively  large  paralogous 

o 

alignment  gaps  suggest  that  the  gaps  may  a  primary  factor  localizing  pairing  and  exchange. 

Gene  conversion  tests  reveal  a  700  bp  gene  conversion  tract  at  the  predominant  paralogous 
recombination  hotspot.  Thus,  high  sequence  identity  at  a  hotspot  may  be  a  result,  as  well  as  a 
potential  cause,  of  propensity  to  pair  at  a  particular  site.  Whether  alignment  gaps  are  a  key 
breakpoint  localization  factor  may  be  eventually  assessed  by  discovery  and  comparison  of  more 
systems  that — like  the  NFIREPs — show  relatively  high  levels  of  alignment  mismatch  between 
paralogs. 
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INTRODUCTION 


In  the  past  five  years,  it  has  become  apparent  that  recombination  between  pairs  of  non-allelic, 
high-identity,  low-copy  number  DNA  repeat  sequences  is  an  important  mechanism  for 
chromosomal  rearrangements  that  cause  human  genetic  disease  (reviewed  by  Emanuel  and 
Shaikh  2001;  Stankiewicz  and  Lupski  2002a).  These  repeated  DNA  elements  have  been  referred 
to  as  REP  (repeats;  Lupski  et  al.  1991),  LCR  (low  copy  repeats;  Shaikh  et  al.  2000),  duplicons, 
or  paralogs.  Recombination  between  misaligned  paralogs  (unequal  crossing  over)  on  sister  or 
non-sister  chromatids,  between  paralogs  on  the  same  chromatid  (loop  out-excision),  or  between 
paralogs  on  different  chromosomes  can  result  in  deletion,  duplication,  inversion,  or  translocation 
(reviewed  by  Inoue  and  Lupski  2002).  Recombination  between  paralogs  has  been  referred  to  as 
“nonallelic  homologous  recombination”  (Stankiewicz  and  Lupski  2002b),  “ectopic 
recombination,”  (Kuroda-Kawaguchi  et  al.  2001)  or  “paralogous  recombination”  (Stephens 
2003)  to  distinguish  it  from  classical,  allelic  homologous  recombination.  Investigation  into  the 
molecular  basis  of  paralogous  recombination  diseases  is  in  its  infancy,  as  few  breakpoints  have 
been  mapped  at  the  sequence  level  and  few  REPs  have  undergone  sequence  and  structural 
analyses. 

Paralogous  recombination  is  the  dominant  mechanism  for  submicroscopiC  contiguous 
gene  deletions  that  encompass  the  NF1  gene  at  chromosome  17ql  1.2.  The  consequent 
.haploinsufficiency  for  the  NF1  protein  product  causes  neurofibromatosis  1  (NF1),  an  autosomal 
dominant  multi-systemic  disorder  that  affects  ~l/3500  individuals  worldwide.  NF1  is  primarily 
characterized  by  multiple  benign  nerve  sheath  tumors  or  neurofibromas  (reviewed  by  Korf 
2002).  An  estimated  5-10%  of  cases  are  due  to  submicroscopic  NF1  microdeletion  (Cnossen  et 
al.  1997;  Rasmussen  et  al.  1998),  while  most  remaining  cases  result  from  premature  translation 


termination  or  splicing  defects  (Messiaen  et  al.  2000).  NF1  microdeletion  carriers  are  of  great 
interest  because  they  are  predisposed  to  developing  early-onset  and  excessive  numbers  of 
neurofibromas  (Kayes  et  al.  1994;  Leppig  et  al.  1997;  Dorschner  et  al.  2000)  and  are  at  increased 
risk  of  developing  malignant  peripheral  nerve  sheath  tumors  (De  Raedt  et  al.  2003;  Kluwe  et  al. 
2003).  Whether  they  are  at  increased  risk  for  other  manifestations,  related  or  unrelated  to  NF1, 
awaits  a  comprehensive  clinical  evaluation  of  a  cohort  of  NF1  microdeletion  subjects. 

STS  mapping  of  17  independent  NF1  microdeletions  revealed  that  82%  had  proximal  and 
distal  breakpoints  clustered  in  two  low  copy  repeat  elements,  designated  NFIREPs,  that  flanked 
the  NF1  gene  (Dorschner  et  al.  2000)  (Figure  1).  The  direct  orientation  of  the  NFIREPs 
suggests  a  mechanism  whereby  NF1REP-P1  and  NF1REP-M  misalign  during  meiosis  with 
subsequent  unequal  crossing  over  between  chromatids,  or  loop-out  excision  between  the  repeats 
on  a  single  chromatid,  resulting  in  a  recurrent  1.5  Mb  NF1  microdeletion  (Dorschner  et  al.  2000) 
(Figure  1).  About  80%  of  de  novo  NF1  microdeletions  occur  on  maternally-derived 
chromosomes  (Upadhyaya  et  al.  1998;  Lopez-Correa  et  al.  2001;  Dorschner  et  al.  submitted). 

This  is  consistent  with  haplotype  analyses  of  healthy  ancestors  of  de  novo  NF1  microdeletion 
subjects  that  suggested  paralogous  recombination  occurred  during  maternal  meiosis  II  (Lopez- 
Correa  et  al.  2000). 

The  NF1  microdeletion  breakpoints  were  mapped  to  intervals  defined  by  paralogous 
sequence  variants  (PSV;  Estivill  et  al.  2002)  that  were  specific  to  an  individual  NF1REP  paralog. 
The  pattern  of  PS  Vs  from  amplicons  of  the  deleted  chromosomes  17  were  expected  to  match  that 
of  either  NF 1 REP-P 1  or  NF 1 REP-M,  or  transition  from  NF 1 REP-P 1  to  NF 1 REP-M  if  the 
amplicon  contained  the  breakpoint  of  the  recombinant  NF1REP.  This  procedure  identified  a  2 
kb  recombination  hotspot  that  accounted  for  46%  (N=54)  of  cases  with  NF1  contiguous  gene 
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deletions  (Lopez-Correa  et  al.  1999).  Recent  breakpoint  mapping  of  additional  cases  has 
extended  this  breakpoint  cluster  region,  herein  designated  PRS2  (paralogous  recombination  site 
2)  to  a  4,155  bp  interval  and  identified  an  additional,  independent  breakpoint  cluster  region 
(designated  PRS1)  of  6,323  bp. 

Together,  breakpoints  at  PRS1  and  PRS2  accounted  for  69%  of  NF1  whole  gene 
deletions  (Lopez-Correa  et  al.  2001;  Dorschner  et  al.  submitted).  In  all  cases,  the  sequence  of  the 
breakpoint  intervals  was  consistent  with  a  perfect  homologous  exchange;  there  was  no  evidence 
of  small  deletions  commonly  associated  with  non-homologous  end  joining  (Valerie  and  Povirk 
2003).  Three  of  these  cases  (one  at  PRS1  and  two  at  PRS2)  showed  evidence  of  gene 
conversion,  with  alternate  NF1REP-P1  and  NF1REP-M  PS  Vs  in  the  exchange  region  (Lopez- 
Correa  et  al.  2001;  Dorschner  et  al.  submitted).  Apparent  gene  conversion  tracts  are  consistent 
with  a  mechanism  similar  to  the  double  strand  break  (DSB)  repair  model  in  yeast  (Haber  2000; 
Johnson  and  Jasin  2001).  Under  this  model,  the  breakpoint  hotspots  at  PRS1  and  PRS2  imply 
the  presence  of  a  sequence  motif,  functional  domain,  or  higher  order  chromosomal  feature  that 
predisposes  to  DSB.  The  primary  sequences  of  the  short  PRS1  and  PRS2  recombination 
substrates  revealed  no  apparent  basis  for  regional  DSBs  (Dorschner  et  al.  2000;  Lopez-Correa  et 
al.  2001).  However,  knowledge  about  number,  orientation,  and  structure  of  NFIREPs  in  the 
genome  in  which  PRS1  and  PRS2  were  embedded  was  lacking.  Here  we  detail  the  primary 
structure  of  four  NFIREPs,  including  the  larger  sequence  context  of  the  PRS1  and  PRS2.  We 
search  for  sequence  features  at  breakpoint  clusters  that  may  explain  their  location  and  discuss 
evidence  for  gene  conversion  between  the  NF 1  REP  paralogs. 


RESULTS 


NFIREPs  Are  Complex  Assemblies  of  Paralogs  from  Different  Sequence  Families 
The  paralogs  NF1REP-P1,  NF1REP-P2,  and  NF1REP-M  flanking  the  NF1  gene  at  17ql  1.2 
(Figure  1  A)  are  represented  by  finished  BACs  in  the  April  2003  assembly  of  the  public  Human 
Genome  Project  (National  Center  for  Biotechnology  Information  (NCBI)  build  32).  This  is  the 
first  NCBI  assembly  in  which  NF1REP  ordering  and  orientation  are  consistent  with  all  evidence 
previously  reported,  including  public  Human  Genome  Project  BAC  sequences,  STS  mapping, 
sequencing  of  breakpoint  regions  in  NF1  de  novo  microdeletion  patients  and  normal  controls, 
mANFl  microdeletion-specific  assays  (Dorschner  et  al.  2000;  Jenne  et  al.  2001;  Lopez-Correa 
et  al.  2001;  Dorschner  et  al.  submitted).  This  public  assembly  and  the  BAC  contig  of  Jenne  et  al. 
(Jenne  et  al.  2003)  are  based  largely  on  different  BACs,  but  both  nevertheless  show  a  finished 
tiling  path  across  1.5  Mb  from  NF1REP-P1  through  NFREP-M, 

Sequence  and  structural  analysis  showed  that  the  NF1REP-P1,  NF1REP-P2,  and 
NF1REP-M  paralogs  vary  in  length  and  composition,  but  are  generally  comprised  of  gene  and 
gene-derived  fragments  and  larger  sequence  blocks  apparently  lacking  genes  (Fig.  IB). 
NF1REP-P1  is  131  kb  in  length,  is  contained  largely  on  BAC  RP1 1-271K1 1  (nt  75,477- 
199,182),  and  extends  6,944  bp  into  the  overlapping  BAC  CTD-2349P21.  NF1REP-M  is  75  kb 
in  length  and  is  fully  contained  in  BAC  RP1 1-640N20  (nt  102147-177525).  NF1REP-P2  is  43 
kb  and  folly  contained  in  BAC  RP1 1-229K15  (nt  43619-86245). 

The  apparently  functional  copy  of  the  gene  KIAA0563rel  is  contained  in  NF1REP-M 
(Jenne  et  al.  2003).  It  is  supported  by  a  foil-length  mRNA  (J KIAA0563-related  gene;  Genbank 
NM_052888)  and  a  substantial  Unigene  cluster  (Hs.367593).  This  gene  is  also  known  by  an 
STS  marker  (WI-12393)  contained  within  it  (Jenne  et  al.  2000;  Jenne  et  al.  2001).  Fragments 


derived  from  this  gene  are  present  in  NF1REP-P1,  the  distal  region  of  NF1REP-M,  and  in 
NF1REP-P2  (green  blocks,  Figure  IB).  NF1REP-M  carries  the  12  exons  and  introns  of  the 
functional  KIAA0563rel,  while  NF1REP-P1  has  an  inverted  segment  containing  intron  7,  exon  8, 
and  part  of  intron  8  followed  by  exons  and  introns  9-12.  Additional  KIAA0563rel  fragments  are 
scattered  throughout  distal  17q  and  on  several  other  chromosomes.  A  33,692  bp  paralogous 
sequence  block  (BAC  RP1 1-271K1 1;  nt  1 12,506-146,198)  is  shared  by  NF1REP-P1  and 
NF1REP-M  and  apparently  lacks  genes  or  pseudogene  sequences  (Figure  IB,  dark  blue) 

The  largest  sequence  blocks  shared  by  NF1REP-P1  and  NF1REP-M  are  51  kb  in  length 
and  show  97.5  percent  sequence  identity  (analysis  of  manually  aligned  sequences  using  the 
DnaSP  program;  Rozas  and  Rozas  1999).  Here  we  refer  to  these  blocks  (represented  by  BAC 
RP1 1-271K1 1;  nt  96,901-147,876)  as  NF1REP-P1-51  and  NF1REP-M-51,  or  collectively  as 
NF1  REP-51.  These  repeats  are  of  particular,  interest  because  they  contain  PRS1  and  PRS2,  the 
sites  where  an  estimated  69%  NF1  microdeletion  breakpoints  occur  (Figure  IB,  red 
blocks)(Lopez-Correa  etal.  2001;  Dorschneretal.  submitted). 

Two  additional  pseudogenes  comprise  the  flanking  regions  ofNFIREP-Pl.  SMURF2 
pseudogene  fragments  (Figure  IB,  yellow  blocks)  are  present  on  NF1REP-P1  (RP1 1-271K1 1;  nt 
75,477-91,072)  and  on  NF1REP-P2  (RP1 1-229K15;  nt  43,619-64,829).  The  functional  copy  of 
the  SMURF2  gene  encodes  E3  ubiquitin  ligase  (NM_022739),  located  at  17q24  and  not 
associated  with  any  other  NFIREP-related  elements.  Distal  to  the  NF1REP-P1-51  is  a  segment 
with  no  apparent  genes  (RP1 D271K1 1,  nt  147,872-184,526),  and  a  LEC2  pseudogene  fragment 
beginning  at  RP1 1-271  Kll  nt  184,527  and  extending  into  CTD-2349P21  (Figure  1C,  light  blue 
and  purple,  respectively).  Parts  of  NF1REP-P1-51  and  these  distal  blocks  have  paralogs  at 
chromosome  19pl3,  which  we  designated  NF1REP-E19  (Figure  1C).  This  REP  contains  the 
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functional  LEC2  gene  and  resides  on  the  three  BACs  indicated.  It  spans  151  kb,  including  33  kb 
of  dispersed  blocks  matching  NFIREP-Pland  -M  and  containing  the  PRS1  and  PRS2  breakpoint 
clusters  (Fig.  IB;  dark  blue  blocks).  NF1REP-E19  matches  both  NFIREP-Pland  -M  at  only  94- 
95%  sequence  identity.  It  contains  no  sequences  matching  KlAA0563rel,  and  KIAA0563rel 
copies  are  not  associated  with  any  SMURF2  or  LEC2  copies  elsewhere. 

Sequence  Composition  Of  Breakpoint  Clusters  and  Hotspots 

The  positions  of  breakpoint  intervals  with  respect  to  sequence  features  of  the  NFIREPs  are 
detailed  in  Figure  2.  The  breakpoint  intervals  that  defined  PRS1  and  PRS2  were  reported 
previously  (Lopez-Correa  et  al.  200 1 ;  Dorschner  et  al.  submitted).  Within  each  PRS  a  majority 
of  breakpoints  is  concentrated  in  a  smaller  hotspot.  Within  PRS1  an  interval  of  551  bp  accounts 
for  60%  of  the  breakpoints  (9/1 5),  and  within  PRS2  an  interval  of  2292  bp  accounts  for  93%  of 
the  breakpoints  (27/29)(Fig.  2). 

Here  we  report  an  additional,  unique  NF1  microdeletion  case  with  breakpoints  mapping 
within  the  NF 1  REP-5 1  repeats,  but  outside  of  either  PRS.  The  breakpoint  interval  of  patient 
UWA 160-1  mapped  to  the  interval  located  about  20  kb  proximal  to  PRS1  (nt  102,102-102,186  in 
BAC  RP1 1-271K1 1;  Figure  2).  Sequencing  the  breakpoint  region  of  the  deleted  chromosome 
17,  which  had  been  segregated  into  a  human-rodent  somatic  cell  hybrid  line,  showed  a  single 
transition  from  an  NF1REP-P1 -specific  PSV  to  an  NFIREP-M-specific  PSV.  This  case 
demonstrates  that  paralogous  recombination  can  occur  between  the  NF1  REP-51  repeats  at  sites 
other  than  PRS1  and  PRS2,  albeit  at  a  significantly  reduced  frequency. 

If  recombination  hotspots  are  due  to  obligate  local  sequence  features,  we  might  expect 
the  PRS  clusters  to  show  some  similarity  of  sequence  composition,  but  sequence  and  structural 
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analysis  of  PRS  1  and  PRS2  revealed  no  commonalities.  BLAST  comparison  of  PRS1  and  PRS2 
showed  no  significant  sequence  identity,  with  the  exception  of  Alu  elements,  LINES,  SINES, 
and  other  high  copy  repeats,  which  typically  shared  <80%  identity  and  lengths  of  <500  bp. 
Furthermore,  PRS1  and  PRS2  showed  entirely  different  patterns  of  high-copy  repeats  and  G+C 
content  (Fig.  2).  PRS1  has  high-copy  repeats  spread  throughout  and  contains  67%  repeats 
overall,  while  PRS2  is  only  44%  repeats  and  the  PRS2  hotspot  is  devoid  of  high-copy  repeats. 
The  entire  NF1REP-51  repeat  is  composed  of  55%  high-copy  repeats,  and  the  entire  BAC 
271K1 1  is  51%  high-copy  repeats.  Thus,  one  PRS  region  has  more  repeats  than  average  for  the 
region,  and  the  other  has  fewer.  The  PRS  1  and  PRS2  clusters  both  have  higher-than-average 
G+C  for  their  contexts  (53.0%  and  55.7%  respectively,  versus  48.4%  for  the  entire  51  kb  high- 
identity  repeat).  However,  the  PRS2  hotspot  is  especially  G+C-rich  (63.7%),  while  the  PRS1 
hotspot  is  not  (52.7%).  These  sequence  composition  values  are  calculated  for  the  NF1REP-P1- 
51,  but  due  to  the  97.5%  identity,  these  values  did  not  differ  markedly  from  those  for  the 
paralogousNFlREP-M-51  repeat. 

NF1REP  Sequence  Alignment,  Sequence  Identity,  and  Breakpoint  Localizations 
The  NF1  REP-51  paralogs  that  harbor  the  breakpoints  in  69%  of  the  NF1  whole  gene  deletions 
differ  more  than  do  typical,  allelic  recombination  substrates.  Mean  sequence  similarity  is  97.5% 
considering  both  indels  and  PSVs.  Of  the  2.5%  overall  divergence,  1.4%  is  represented  by  PSVs 
and  1.1%  by  indels  (analysis  of  manually  aligned  sequences  using  the  DnaSP  program;  Rozas 
and  Rozas  1999). 

Segments  of  the  aligned  NF1  REP-51  differ  in  alignment  quality.  The  top  panel  of  Figure 
3  shows  the  size  of  all  alignment  gaps  between  the  NF1REP-P1-51  and  NF1REP-M-51  repeats. 
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PRS1  and  PRS2  contain  sparse  misalignments  of  1  to  4  bp.  Both  PRS  are  devoid  of  large  gaps, 
although  both  are  flanked  proximally  by  markedly  larger  gaps.  The  36-bp  alignment  gap  at  the 
front  of  PRS  1  is  due  to  a  (GA)n  microsatellite  that  is  fixed  at  nine  repeat  units  in  NF1REP-P1 
and  ranges  from  15  to  31  units  in  NF1REP-M.  PRS2  is  bordered  by  a  unique  24  bp  indel  that  is 
a  fixed  difference  among  the  three  NF1REP-P1  and  two  NF1REP-M  sequences  currently 
available  in  Genbank  (accession  numbers  in  Methods;  our  own  PRS2  sequences  in  patients  and 
normals  did  not  extend  proximally  to  this  site).  These  relatively  large  alignment  gaps  flanking 
the  PRS  apparently  do  not  prevent  pairing  for  recombination,  as  exchange  can  occur  within 
distances  of  less  than  1  kb  from  the  gaps.  However,  the  principle  hotspots  are  located  1  to  2  kb 
distal  to  these  alignment  gaps  (Figure  2),  where  paralogous  misalignments  are  limited  to  1  to  4 
bp  both  in  available  Genbank  entries  and  in  our  hotspot  sequences  from  patients  and  normals 
(see  Methods). 

On  a  kilobase  scale,  the  PRS  are  not  regions  of  greater  or  lesser  paralogous  sequence 
identity,  as  shown  by  the  spatial  distribution  of  PSVs  in  the  middle  panel  of  Figure  3.  The 
overall  percent  divergence  for  NF1 REP-P 1-51  andNFlREP-M-51  is  1.36%  (indels  excluded), 
and  the  corresponding  values  for  PRS1  (1.17%)  and  PRS2  (1.45%)  are  not  significantly  different 
from  the  overall  mean  (2x2  contingency  CW-square  tests;  p>0.1).  Nucleotide  site  differences 
(PSVs)  appear  uniformly  distributed  across  the  entire  NF1REP-51,  with  two  notable  exceptions. 
The  high  divergence  tract  at  98.4  kb  contains  a  matrix  attachment  region  (MAR;  see  below). 

The  other  high  divergence  tract  at  1 1 8  kb  is  a  short  (95  bp)  segment  with  1 5  variable  sites  ( 1 3 
PSVs  and  2  indels  of  1  bp  each)  where  NF1REP-P1-51  matches  a  segment  of  NF1REP-E19. 
These  variable  sites  are  fixed  in  the  three  sequenced  alleles  of  NF1  REP-P  1-51  available  in 
Genbank  (accession  numbers  in  Methods),  and  they  likely  represent  an  early  gene  conversion 
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event  that  followed  duplication  of  the  NF1  REP-51  paralogs.  The  NF1REP-E19  blocks  matching 
the  NF1REP-P1-51  (Figure  1C)  show  only  94-95%  identity  between  these  inter-chromosomal 
paralogs.  Gene  conversion  tests  and  searches  for  other  NFIREP-specific  variants  shared 
between  NF1REP-E19  and  its  paralogs  on  chromosome  17  show  no  other  evidence  of  historical 
gene  conversion  or  recombination  (data  not  shown). 

Recombinogenic  Structures  and  Domains 

We  searched  the  NF1REP  breakpoint  clusters  for  structures  reported  to  be  associated  with 
recombination  in  other  settings.  In  addition,  we  looked  for  commonalities  between  the 
breakpoint  clusters  of  NF1  and  those  reported  in  Charcot-Marie-Tooth  disease  type  1A 
(CMT1  A),  another  paralogous  recombination  disorder  in  which  breakpoints  have  been  mapped 
in  similar  detail. 

Several  genomic  rearrangements  have  been  associated  with  palindromes  including  the 
large,  recurrent  deletions  in  the  Y-chromosome  AZFc  region  that  lead  to  male  infertility 
(Repping  et  al.  2002)  and  translocations  by  AT-rich  palindromic  sequences  involving  t(l  1;22) 
(Kurahashi  and  Emanuel  2001)  and  t(17;22)  (Kurahashi  et  al.  2003),  the  latter  of  which  occurs  in 
NF1  intron  31.  Furthermore,  studies  in  yeast  demonstrate  that  palindromes  longer  than  50  bp 
can  be  recombination  hotspots  (Nag  and  Kurst  1997;  Nasar  et  al.  2000).  The  NF1REP-51 
repeats  contain  no  palindromes  larger  than  18  bp  and  separated  by  63  bp  when  the  arms  are 
allowed  to  be  100  bp  and  separated  by  up  to  100  bp.  Yet,  these  default  settings  of  the 
Palindrome  program  readily  detected  the  AT-rich  palindrome  in  NF 1  intron  3 1 .  More 
permissive  pattern  searches  for  large  inverted  repeats  performed  on  the  entire  NF1REP-P1  and 
NF1REP-M  revealed  a  single  significant  pair  of  sequences  that  straddles  the  proximal  boundary 


of  the  NF1REP-P1-51  repeat  (Fig.  IB;  light  green  blocks).  These  non-tandem  inverted  repeats 
are  5,778  bp  long,  are  separated  by  3,708  bp  of  unrelated  sequence,  show  98.6%  sequence 
identity,  and  contain  copies  of  KIAA0563rel  exon  8.  The  “inverted”  segment  of  the  pair  is 
present  only  on  NF1REP-P1  (Figure  IB).  Pairing  between  these  repeats  of  NF1REP-P1  is 
unlikely  to  account  for  preferential  DSB  that  initiate  paralogous  recombination  at  PRS1  and 

i 

PRS2,  which  are  15  kb  and  36  kb  telomeric  to  the  distal  copy  of  the  repeat. 

A  role  for  specific  sequence  signals  in  mammalian  recombination  is  not  established,  but 
tests  for  the  presence  of  motifs  with  demonstrated  or  suspected  roles  in  recombination, 
transcription,  or  replication  are  still  warranted.  In  E.  coli,  recombination  is  stimulated  by  the  8- 
bp  Chi  sequence  [5?-GCTGGTGG-3’].  BAC  271K1 1  contains  20  Chi-like  sites  in  a  span  of  200 
kb.  Within  this  span,  the  NF1REP-P1-51  kb  block  contains  5  Chi-like  sites,  one  of  which  falls 
within  the  PRS2  hotspot  (Lopez-Correa  et  al.  2001),  but  the  other  four  are  evenly  distributed 
without  apparent  association  with  breakpoints.  NF1REP-M-51  has  Chi- like  sequences  at  the 
same  five  sites.  Due  to  their  association  with  recombination  in  yeast  (Gale  et  al.  1992;  Badge  et 
al.  2000),  we  also  searched  for  autonomously  replicating  sequences  (ARS).  BAC  271K1 1 
contains  13  S.  cerevisiae  ARS  sites.  The  NF1  REP-51  both  contain  the  same  five  S,  cerevisiae 
ARS  sites,  one  of  which  is  within  PRS1  but  2  kb  distal  to  the  PRS1  hotspot.  This  does  not 
indicate  a  marked  excess  of  ARS  sites  in  the  5 1  kb  repeat  or  in  the  PRS,  and  the  proximity  of  one 
ARS  site  to  the  PRS  1  hotspot  could  be  happenstance.  Neither  were  the  other  putative 
recombinogenic  motifs  (see  Methods)  found  to  be  preferentially  associated  with  PRS1  or  PRS2 
(Dorschner  et  al.  2000;  Lopez-Correa  et  al.  200 1 ;  Dorschner  et  al.  submitted). 

In  yeast,  recombination  is  locally  stimulated  by  transcription  (Freedman  and  Jinks- 
Robertson  2002;  Gonzalez-Barrera  et  al.  2002).  There  is  one  known,  expressed  gene 
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( KIAA0563rel )  in  NF1REP-M  (Fig.  IB)  and  5  of  its  exons  are  in  the  NF1REP-M-5 1  repeat, 
indicating  that  this  is  at  least  at  times  functionally  accessible  chromatin.  NF1REP-P1  is  missing 
the  promoter  region  of  this  gene.  However,  a  Unigene  cluster  (Hs.306516)  does  map  to  proximal 
NF1REP-P1.  This  transcription  unit  is  distinct  from  that  matching  KIAA0563rel  (Hs.367593), 
suggesting  that  both  regions  are  transcribed,  although  an  examination  for  ORFs  did  not  indicate  a 
functional  protein  product  from  the  proximal  site.  A  search  for  functional  chromatin  based  on 
promoter  sequences  (First  Exon  Finder;  see  Methods)  found  two  overlapping  promoter/first 
exons  on  opposite  strands  within  the  PRS2  hotspot  and  one  proximal  to  PRS1  (Figures  4  and  5). 
PRS 1  and  PRS2  are  not  associated  with  any  known  genes,  but  the  possibility  that  promoter-like 
sequences  render  chromatin  accessible  even  when  not  associated  with  functional  genes  must  be 
considered. 

Singh  et  al.  (Singh  et  al.  1997)  established  six  decision  rules  for  identifying  DNA  sites 
associated  with  chromatin  function:  origin  of  replication  (ORI)  signals,  TG-richness,  curved 
DNA,  kinked  DNA,  topoisomerase  II  binding,  and  AT  richness.  Together  these  rules  define  the 
MAR,  or  matrix  attachment  region.  An  800  bp  long,  high-scoring  MAR  is  centered  1500  bp  into 
the  NF1  REP-51  repeats  (Fig.  3),  and  the  NF1REP-P1  and  NF1REP-M  paralogs  of  this  tract  are 
each  the  highest-scoring  MAR  in  their  respective  BAC  context.  Both  score  most  strongly  for  die 
ORI  rale,  the  topoisomerase  II  binding  rale,  and  the  AT-richness  rale,  and  they  are  very  low-GC 
(37.6%).  The  paralogs  differ  in  this  region  by  the  insertion  of  several  AT-rich  tracts  in  NF1REP- 
M-51,  and  by  numerous  apparent  PSVs  in  regions  of  complex  AT-rich  sequence  that  are  difficult 
to  align.  Together,  these  findings  suggest  a  site  involved  in  chromatin  function  located  within 
KIAA0563rel,  the  only  known  expressed,  apparently  functional  gene  in  the  NF1  REP-51  repeats, 
and  the  openness  of  this  chromatin  may  enhance  the  likelihood  of  paralogous  pairing.  Promoter- 


like  sequences  near  the  PRS1  and  PRS2  hotspots  (Figs,  3  and  4)  may  likewise  enhance 
accessibility  for  pairing  in  these  regions. 

Gene  Conversion  Tests  in  the  NFIREPs 

In  the  absence  of  gene  conversion  or  recombination,  paralogous  sequences  should  have  evolved 
independently  and  sequence  differences  should  be  distributed  at  random.  Larger  stretches  of 
perfect  match  than  expected  at  random  are  evidence  in  support  of  gene  conversion.  The  longest 
fragment  of  perfect  match  between  the  finished  B  AC  sequences  for  NF 1 REP-P 1  and  NF 1 REP-M 
is  a  700-bp  tract  located  in  the  PRS2  hotspot  (nt  143,268-143,967;  Figs.  2,  3  and  4).  A  statistical 
test  (GeneConv;  Sawyer  1999)  on  the  NF1REP-51  repeats  from  the  reference  BACs  271K1 1  and 
640N20  revealed  this  to  be  the  single  significant  fragment  (p  =  0.008;  indels  included  as 
polymorphisms).  Among  the  five  Genbank  NF1REP  sequences  (three  proximal  and  two  medial; 
accession  numbers  in  Methods)  a  perfect  tract  at  this  location  was  the  largest  in  each  of  the  six 
paralogous  comparisons,  ranging  in  size  from  627  to  700  bp  due  to  polymorphisms.  In  each  case 
when  the  tract  was  shorter  (e.g.  627  bp  using  proximal  BAC  271K1  land  medial  BAC  2646J6), 
the  p-  value  of  the  test  was  higher  but  still  significant  (p  =  0.019;  indels  included  as 
polymorphisms). 

We  also  searched  the  alignment  of  the  five  copies  of  NFIREPs  for  shared 
polymorphisms.  Two  sites,  both  on  the  proximal  border  of  the  700-bp  putative  gene  conversion 
tract,  showed  the  same  two  variants  in  both  paralogs  (a  3 -bp,  +/-[AGC]  indel,  and  a  G/A  SNP;  nt 
142,308  and  143,314  respectively;  Fig.  4).  Such  polymorphisms  in  both  paralogs — especially 
the  identical  3-bp  indel — suggest  a  gene  conversion  event  subsequent  to  duplication  of  the 
region.  , 
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DISCUSSION 

Disease-causing  genome  rearrangements  due  to  recombination  between  paralogs  range  in  size 
from  less  than  10  kb  to  as  large  as  four  megabases,  and  the  size  of  the  REPs  involved  ranges 
from  a  few  kb  to  more  than  500  kb  (reviewed  by  Inoue  and  Lupski  2002;  Stankiewicz  and 
Lupski  2002a).  The  NFIREPs  that  mediate  NF1  microdeletion  have  a  modular  sequence 
structure,  scattered  pseudogene  fragments,  a  single  expressed,  functional  gene  and  a  related  set 
of  paralogs  on  a  different  chromosome.  The  NF  IREPs  are  more  complex  than  CMT1A  REPs 
(Reiter  et  al.  1996;  Lopes  et  al.  1999),  which  are  simple  related  sequence  blocks  in  direct 
orientation.  However,  they  are  not  so  complex  as  several  other  examples  such  as  the  REPs  that 
mediate  deletion  in  Smith-Magenis  and  Williams-Beuren  Syndromes  (Park  et  al.  2002)  (Bayes  et 
al.  2003),  which  contain  multiple  clusters  of  larger  blocks  in  different  combinations  and  different 
orientations.  In  all  of  these  examples  the  REPs  contain,  or  are  contained  within,  transcription 
units,  and  often  the  derived  paralog  contains  a  pseudogene  or  pseudogene  ffagment(s)  of  the 
original  gene. 

Hotspots  and  Sequence  Motifs 

The  predominant  mechanism  of  NF1  microdeletion  is  pairing  and  recombination  between 
NFlREP-Pl-51  and  NF1REP-M-51,  with  all  but  one  of  the  exchange  events  occurring  in  one  of 
two  clusters  (PRS1  and  PRS2).  These  clusters  represent  only  19%  of  the  alignable  51  kb 
sequence  apparently  available  for  pairing  and  exchange.  Breakpoint  mapping  in  the 
CMT1A/HNPP  duplication/deletion  disorders  also  identified  two  recombination  hotpots  (Lopes 
et  al.  1998;  Reiter  et  al.  1998)  and  additional  comparisons  to  the  NFIREPs  are  instructive.  In 


neither  NF1  nor  CMT1 A  were  there  common,  sequence-level  features  that  would  suggest  why 
the  breakpoint  clusters  occur  where  they  do.  We  found  no  actual  sequence  identity  between 
paralogous  recombination  hotspots  either  within  or  between  these  two  disorders,  and  no 
similarity  in  sequence  features  or  composition.  NF1REP  PRS2  contains  a  CTn-like  site  and 
promoter-like  elements,  but  PRS1  contains  neither.  Likewise,  the  two  CMT1 A  paralogous 
recombination  hotspots  are  entirely  different.  The  major  hotspot  (75%  of  cases)  is  flanked  by  a 
Mariner-like  transposon  element,  a  Chi-like  site,  and  a  minisatellite,  but  the  secondary  hotspot 
(20%  of  cases)  is  associated  with  no  such  motifs  (Reiter  et  al.  1998).  At  only  24  kb  (Reiter  et  al. 
1996),  the  CMT1 A  REPs  are  smaller  than  the  NF1REP-51  and  the  modular  structure  of  the 
CMT1 A  REPs  is  simpler  (Inoue  et  al.  2001).  The  CMT1 A  REPs  also  have  lower  percent  G+C 
(41%  vs.  48%  for  NF1  REP-51),  fewer  high-copy  repeats  (31%  vs.  55%  for  NF1REP-51),  no 
strong  MAR  sites,  and  tighter  breakpoint  clustering.  From  the  evidence  of  these  two  paralogous 
recombination  disorders,  there  is  no  support  for  the  existence  of  obligate,  consensus 
recombinogenic  motifs  or  features. 

There  are  to  date  many  more  examples  of  breakpoints  mapped  at  the  sequence-level  for 
allelic  recombination  events  (reviewed  by  de  Massy  2003)  than  for  paralogous  recombination 
events.  In  both  homologous  and  paralogous  recombination,  however,  multiple  examples  suggest 
that  putative  recombinogenic  motifs  may  be  influential  but  they  are  not  obligatory.  Several 
examples  in  yeast  (Nag  and  Kurst  1997;  Nasar  et  al.  2000)  suggest  that  large  palindromes 
enhance  recombination  by  inducing  DSBs,  and  a  variety  of  local  sequence  features  ranging  from 
simple  repeats  to  promoters  and  transposons  can  enhance  recombination  at  a  specific  sites.  For 
example,  a  hotspot  1.0- 1.6  kb  in  the  mouse  MHC  was  found  to  be  centered  on  an  MT  element 
(Yauk  et  al.  2003),  and  in  humans  specific  sequences  such  as  microsatellites  (Benet  et  al.  2000) 


and  minisatellites  (Wahls  et  al.  1990;  Boan  et  al.  1998;  Jeffreys  et  al.  1998)  are  strongly 
associated  with  recombination  hotspots.  It  is  unclear  what  features  serve  to  localize  breakpoints 
when  such  recombinogenic  motifs  are  apparently  lacking,  as  in  the  case  of  the  NF] 
microdeletion. 

Breakpoint  Clustering  and  Alignment  Mismatch 

The  NFIREPs  and  the  CMT1 A  REPs  differ  markedly  in  the  degree  and  nature  of  sequence 
differences  between  paralogs,  and  this  may  relate  to  different  causes  of  breakpoint  localization  in 
the  two  systems.  The  NF1  REP-51  repeats  show  97.5%  sequence  identity,  lower  than  for  the 
CMT1A  REPs  at  98.3%.  The  proportions  of  PSVs  are  approximately  the  same  (1.4%  and  1.2%, 
respectively).  However,  the  NF1  REP-51  repeats  have  nearly  three  times  the  proportion  of  bases 
in  alignment  gaps  in  the  CMT1A  REPs  (1.1%  and  0.4%,  respectively).  Despite  the  greater 
proportion  of  gaps  in  the  NFIREPs,  within  them  the  PRS  are  regions  of  relatively  good 
alignment,  with  indels  of  only  1-4  bp  (Figs.  3  and  4).  Larger  gaps  of  approximately  10  to  40  bp 
are  contiguous  to  but  not  contained  within  the  two  PRS.  The  lack  of  such  large  alignment  gaps 
in  the  CMT1 A  REPs  means  that  study  of  these  REPs  doesn’t  so  directly  address  the  effect  of 
gaps  on  breakpoint  clustering.  The  simplest  explanation  for  the  location  of  the  NF1  PRS  clusters 
is  that  gaps  of  1-4  bp  do  not  prevent  effective  pairing  and  subsequent  genetic  exchange,  but  that 
larger  gaps  (here  approximately  10  to  40  bp)  do  not  allow  effective  pairing  and  exchange  in  their 
immediate  vicinity. 
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Chromatin  Accessibility 

If  local  motifs  are  not  obligatory,  another  reasonable  explanation  of  breakpoint  clustering  is 
accessibility  of  DNA  to  proteins  of  the  recombination  machinery.  In  yeast  there  is  abundant 
evidence  that  crossovers  occur  at  sites  of  accessible  chromatin  (promoters,  transcription  factor 
binding  sites)  (Baudat  and  Nicolas  1997;  de  Massy  2003).  This  may  be  true  in  mammals  but  the 
evidence  to  date  is  less  strong.  For  NF1  microdeletion,  the  PRS1  and  PRS2  hotspots  appear  to 
be  in  non-transcribed  DNA  in  both  participating  paralogs,  NF1REP-P1  and  NF1REP-M.  The 
only  NF1  REP-51  breakpoint  in  a  coding  region  is  that  of  case  UWA 160-1  (Figure  2),  which 
occurs  in  intron  7  of  KlAA0563rel.  However,  chromatin  accessibility  due  to  promoters  even 
substantially  upstream  of  an  exchange  region  may  play  a  role  enhancing  recombination,  and  the 
mere  presence  of  promoter-like  sequences  even  unattached  to  a  gene  (apparently  the  case  for 
NF1REP  PRS2)  may  make  the  site  more  accessible  to  recombination  proteins.  About  half  of 
reported  paralogous  recombination  disorders  involve  recombination  between  either  related 
expressed  genes,  or  between  an  expressed  gene  and  its  pseudogene  (Stankiewicz  and  Lupski 
2002a),  and  the  relative  accessibility  of  the  chromatin  may  persist  even  as  the  transcription  unit 
becomes  non-functional. 

Another  chromatin  feature  thought  to  bear  on  recombination  is  the  matrix  attachment 
region  (MAR).  Strong  topoisomerase  II  sites  have  been  shown  in  translocation  studies  to  co¬ 
localize  with  experimentally  detected  DNase  I  sensitive  sites,  and  several  have  been  mapped  at 
inter-chromosomal  translocation  breakpoints  (Zhang  et  al.  2002).  None  of  the  NF1 
microdeletion  breakpoints  occurred  at  the  MAR  site  in  the  NF1  REP-51  repeat  (Figure  3), 
arguing  against  matrix/scaffold  attachment  as  a  primary  breakpoint-localizing  feature  in  this 
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disorder. 


Gene  Conversion  Between  Paralogs 

The  finding  of  significantly  long  identical  sequences  and  shared  polymorphisms  suggest  that 
gene  conversion  has  occurred  in  the  NF1  PRS2  hotspot.  In  addition,  above  we  cited  three  cases 
where  sufficient  sequence  detail  from  the  parental  chromosomes  demonstrated  gene  conversion 
in  the  process  of  strand  exchange  (Lopez-Correa  et  al.  2001;  Dorschner  et  al.  submitted).  The 
same  evidence  for  gene  conversion  accompanying  crossover  was  detailed  for  the  primary 
CMT1A/HNPP  recombination  hotspot  (Reiter  et  al.  1998;  Lopes  et  al.  1999).  If  gene  conversion 
accompanies  crossover  events  ascertained  by  pathology,  it  is  also  possible — even  likely — that 
gene  conversion  between  the  same  paralogs  may  occur  in  the  absence  of  crossover.  Such  events 
would  not  be  pathological  but  should  be  detectable  by  sequencing  large  numbers  of  normal 
alleles.  The  two  diseases  cited  here  are  promising  early  evidence  that  recombination  hotspots 
coincide  with  and  are  mechanistically  related  to  gene  conversion  hotspots, 

It  is  suggested  that  gene  conversion  acts  to  homogenize  paralogous  sequences,  and  that 
this  in  turn  makes  them  more  likely  to  pair  (Hurles  2001;  Stankiewicz  and  Lupski  2002b)  but  we 
cannot  say  with  certainty  which  is  cause  and  which  is  effect.  That  is,  either:  1)  a  hotspot  region 
is  prone  to  effective  pairing  for  reasons  external  to  the  tract,  and  gene  conversion  events  within 
the  tract  serve  to  homogenize  the  paralogs  at  this  site;  or  2)  a  hotspot  region  preferentially  pairs 
because  it  happens  to  be  a  larger  perfect  tract  than  the  neighboring  sequence,  and  the  resulting 
gene  conversion  helps  to  maintain  this  high  sequence  identity. 

Some  care  is  warranted  in  using  single  copies  of  paralogous  sequences  to  test  for 
paralogous  gene  conversion,  because  the  size  of  perfect  fragments  depends  strongly  on 
population  variation  for  the  defining  PSVs.  Among  the  five  copies  of  the  NFIREPs  available  in 
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Genbank  (two  of  NF1REP-P1  and  two  of  NF1REP-M;  see  Methods),  a  fragment  at  the  site  of  the 
700  bp  matching  fragment  reported  above  varies  in  size  but  is  always  the  largest  (and  a 
statistically  significant;  all  p< 0.05)  matching  fragment.  We  also  applied  the  same  test  we  used 
on  the  NFIREPs  (Sawyer’s  GeneConv;  see  Methods)  to  the  original  Genbank  copies  of  the 
CMT1 A  REPs  (Reiter  et  al.  1996);  accession  numbers  in  Methods)  and  found  a  950-bp 
significant  fragment  (p  =  0.0004;  indels  included  as  variants).  This  tract  coincides  with  the 
secondary  (20%)  CMT1 A  recombination  hotspot  (nt  24879-25830  in  Genbank  #U71218).  Using 
alternate  CMT1 A  REP  sequences  (accession  numbers  in  Methods)  shortens  the  significant  950 
bp  fragment  to  658  bp  but  even  at  this  length  it  is  still  significant  (p- 0.020).  Hurles  et  al.  (2001) 
reported  statistical  evidence  of  gene  conversion  in  these  sequences,  but  they  did  not  identify 
particular  significant  fragments. 

Analysis  of  normal  sequences  for  conversion  tracts  detects  where  paralogous  pairing 
apparently  has  occurred  but  crossover  did  not.  Deeper  allele  sampling  of  these  sites  in  normal 
genomes  would  greatly  help  characterize  these  events.  If  bom  out  by  subsequent  studies,  the 
coincidence  of  the  hotspots  for  both  crossover  and  gene  conversion  would  strongly  support  a 
dual  pathway  recombination  model  in  humans  similar  to  that  proposed  for  yeast,  where  the  initial 
steps  include  a  DSB  and  pairing,  but  the  outcome  may  be  either  gene  conversion  without 
crossover,  or  recombination  (with  or  without  gene  conversion;  reviewed  by  de  Massy  2003). 

Pairings  between  paralogs  are  “mistakes,”  where  the  paired  sequences  may  differ 
considerably  more  than  typical  alleles  (homologs),  but  they  still  match  sufficiently  that 
subsequent  recombination  steps  apparently  proceed  normally.  The  main  contribution  of 
paralogous  pairing  to  our  knowledge  of  recombination  in  general  may  be  in  those  examples 
where  the  paralogs  are  especially  small  or  (as  in  the  NFIREPs)  the  paralogous  alignment  match 
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displays  substantial  gaps.  Accumulation  of  such  examples  will  serve  to  illustrate  the  minimal 
length  and  quality  of  sequence  pairing  needed  to  effect  recombination  in  humans. 

METHODS 

Patients 

NF1  patients  carrying  microdeletions  have  been  described  previously  (Dorschner  et  al.  2000; 
Lopez-Correa  et  al.  2001);  (Dorschner  et  al.  submitted)  and  references  therein.  Patient  NF160-1 
is  described  in  Leppig  et  al.  ( 1997). 

Identification  of  a  Novel  Breakpoint 

The  breakpoints  of  UW  A 160-1  were  mapped  previously  to  NF1REP-P1  and  NF1REP-M  by  STS 
analyses  of  somatic  cell  hybrid  lines  carrying  only  the  deleted  chromosome  17  (Lopez-Correa  et 
al.  2001;  Dorschner  et  al.  submitted).  Here  we  mapped  the  breakpoint  to  a  region  about  10  kb 
centromeric  of  PRS1  by  comparing  sequences  of  the  deleted  chromosome  17  to  paralog'ous 
sequences  from  BAC  clones  and  normal  chromosome  17s,  as  described  previously  (Dorschner  et 
al.  2000;  Dorschner  et  al.  submitted).  An  approximately  5  kb  amplicon  containing  the 
breakpoint  was  amplified  from  genomic  DNA  of  the  hybrid  line  carrying  the  deleted 
chromosome  with  primers  160P1-F  (5’-GGCTCATGTGTAATGATCTTTAACGCG-3’)  and 
MD1-R(v2)  (5  ’-CAAAGATTATCACTGATGGAGTTGG-3  ’).  This  amplicon  was  sequenced 
with  several  internal  primers  to  identify  the  breakpoint  interval.  Note  that  the  above  primers  are 
not  necessarily  NFIREP-specific  and  may  amplify  non-recombinant  NF1REP  from  normal 
individuals. 
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Defining  REP-Speeific  Nucleotide  Site  Variants 

Paralogous  sequence  variants,  or  “PS Vs”  were  initially  defined  by  differences  between  the 
reference  BACS  (see  below)  for  each  paralog.  The  pattern  of  PS  Vs  from  amplicons  of  the 
deleted  chromosomes  17  were  expected  to  match  that  of  either  NF1REP-P1  or  NF1REP-M,  or 
transition  from  NF1REP-P1  to  NF1REP-M  if  the  amplicon  contained  the  breakpoint  of  the 
recombinant  NF1  REP.  This  strategy  narrowed  the  recombination  interval,  which  was 
subsequently  identified  by  repeating  the  process  with  closely  spaced  primers  from  the  interval  of 
interest. 

To  help  differentiate  PSVs  from  common  polymorphisms  at  allelic  sites  (SNPs),  we  also 
amplified,  cloned,  and  partially  sequenced  nonrecombinant  PRS1  and  PRS2  regions  from  both 
NF1REP-P1  and  NF1REP-M  of  six  normal  chromosomes  and  of  the  NF1REP-M  carried  in  BAC 
951F1 1,  using  the  Expand  LT  PCR  system  (Roche  Molecular  Biochemicals,  Mannheim, 
Germany).  Amplicons  were  column  purified  and  cloned  into  the  TOPO  XL  PCR  cloning  kit 
(Invitrogen  Corporation,  Carlsbad,  CA).  Plasmid  preparations  of  clones  were  purified  and 
sequenced  directly  using  internal  primers  as  described  previously  (Dorschner  et  al.  2000;  Lopez- 
Correa  et  al.  2001;  Dorschner  et  al.  submitted). 

Genomic  Contigs  and  Reference  Sequences 

The  region  spanning  between  and  including  NF1REP-P1  and  NF1REP-M  is  finished,  assembled 
sequence  in  NCBI  build  32  (April  2003).  Finished  BACs  for  the  two  regions  (RP1 1-271K1 1  and 
CTD-2349P21  for  NF1REP-P1  and  RP1 1-640N20  for  NF1REP-M)  were  used  as  primary 
reference  sequences.  Two  additional  BACs  closely  matching  NF1REP-P1  are  in  draft  state 
(RP13-715K7;  RP1 1-1403G4)  and  one  additional  BAC  matching  NF1REP-M  is  finished  (CTD- 
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2646 J6).  Finished  chromosome  19  BACs  representing  NF1REP-E19  are  indicated  in  Fig.  1C. 
For  the  proximal  and  distal  CMT1 A  REPs  our  primary  references  were  Genbank  U71217  and 
U71218,  respectively.  Additional,  finished  copies  of  the  CMT1 A  REPs  were  taken  from 
AC005838  (proximal)  and  assembled  using  AC005389  and  AC005838  (distal). 

Sequence  Composition  and  Motif  Searches 

We  searched  for  short  sequence  motifs  using  the  EMBOSS  program  “Fuzznuc.”  Putative 

recombination  motifs  were  chosen  from  lists  in  Lopes  et  al.  (1998),  Badge  et  al.  (2000)  and 

Lopez-Correa  et  al.  (2001):  E.  coli  Chi  sequence  (5’-GCTGGTGG-3’), 

yeast  Ade6-M26  heptamer  (5’-ATGACGT-3’),  XY32  homopurine/homopyrimidine 

(5  ’-AAGGGAGAARGGGTAAAGGGRAAGAGGGAA-3  ’), 

retroposon  LTR  (5  ’-TCATACACCACGCAGGGGTAGAGGACT-3  ’), 

the  long  terminal  repeat  element  LTR-IS  (5’-TGGAAATCCCC-3’), 

protein  binding  site  pur  (5’-GGNNGAGGGAGARRRR-3’), 

translin  consensus  1  (5’-GCNCWSSWN(0-2)GCCCWSSW-3’), 

translin  consensus  2  (5  ’-MTGCAGN(0^)GCCCWSSW-3  ’), 

Saccharomyces  cerevisiae  ARS  (5’-WTTTATRTTTW-3’), 

Schizosaccharomyces pombe  ARS  (5’-WRTTTATTTAW-3’), 

human  replication  origin  consensus  (5’-WAWTTDDWWWDHWGWHMA-WTT-3’),  and 
human  minisatellite  core  sequences  (5’-GGGCAGGAG-3’;  5  ’  -GG AGGTG-GGC  AGG ARG-3  ’ 
5’-AGAGGTGGGCAGGTGG-3’).  Minisatellite  core  sequences  were  searched  using  the  9  bp 
core  sequence  allowing  one  mismatch,  and  the  16  bp  core  sequences  allowing  up  to  4 
mismatches. 


Known  genes  and  annotated  features  were  assessed  with  the  NCBI  (Genbank)  and  UCSC 
Human  Genome  Working  Draft  databases  and  browsers.  We  searched  the  NF1REP  for  internal 
repeats  by  pairwise  BLAST  (NCBI),  Tandem  Repeats  Finder  (Benson  1999),  PattemHunter  - 
(Ma  et  al.  2002),  and  the  EMBOSS  program  “Palindrome.”  We  tested  for  matrix  attachment 
regions  (MARs)  using  MarWiz  (Singh  et  al.  1997).  High-copy  repeats,  simple  repeats  and  other 
simple  DNA  were  detected  with  RepeatMasker  (see  Web  Site  References),  and  graphical  results 
for  repeats  and  GC  percent  were  obtained  with  Gestalt  Workbench  (Glusman  and  Lancet  2000). 
Candidate  promoters  and  first  exons  were  detected  with  First  Exon  Finder  (“FirstEF”;  Davuluri 
et  al.  2001). 

Polymorphic  Sites  and  Gene  Conversion  Tests 

Sliding  window  sequence  polymorphism  and  divergence  plots  were  created  with  DnaSP  (Rozas 
and  Rozas  1999),  which  tabulates  differences  within  and  between  populations,  reports  shared 
variants,  and  outputs  alignments  listing  polymorphic  sites  only  (with  alignment  gaps  included  or 
excluded).  The  gene  conversion  test  of  Sawyer  (GeneConv;  Sawyer  1989)  tabulates  all 
differences  between  sequences,  tests  the  distribution  of  perfect  fragment  sizes  against  random 
simulations  of  the  dataset,  and  reports  p-values  for  the  probability  of  obtaining  a  particular 
perfect  match  by  chance.  This  can  be  done  with  or  without  inclusion  of  indels  as  polymorphic 
sites.  When  included,  contiguous  indel  sites  are  treated  as  single  variants.  Significance  values 
are  Bonferroni  corrected  for  the  number  of  tests,  and  results  are  listed  by  decreasing  fragment 
size  and  p-value.  A  program  that  performs  these  tests  is  available  (see  Web  Site  References). 
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FIGURE  LEGENDS 


Figure  1.  Arrangement  and  structure  of  the  NF1REP  paralogs  flanking  the  NF1  gene.  (A)  The 
350  kb  NF1  gene  (transcribed  left  to  right)  is  flanked  by  the  paralogs  NF1REP-P1,  -P2,  and  -M 
(boxes).  NF1REP-P1  and  NF1REP-M  are  in  direct  orientation  (arrows).  The  extent  of  the 
recurrent  1.5  Mb,  germline  microdeletion  is  shown  above  die  map  (Dorschner  et  al.  2000). 

Genes  labeled  with  vertical  bars  are  presumed,  functional  genes  from  the  RefSeq  database,  with 
the  exception  of  JJAZ1-W,  a  pseudogene  of  JJAZ1  (Jenne  et  al.  2003).  The  BAC  identifiers  and 
accession  numbers  refer  to  the  first  finished  Genbank  sequences  for  that  feature.  Drawing  is  to 
scale.  (B)  The  three  paralogs  in  the  NF1  region  are  shown  oriented  from  centromere  to  telomere. 
BAC  identities  and  accession  numbers  are  shown.  Green  color  indicates  sequences  of  the 
KIAA0563rel  functional  gene  and  related  pseudogene  fragments  (W),  with  numbered  black 
vertical  bars  designating  exons  or  related,  exon-derived  sequences.  Light  green  boxes  within 
KIAA0563rel  copies  indicate  the  orientation  of  a  5.8  kb  inverted  repeat  with  two  copies  on 
NF1REP-P1,  and  a  third  copy  in  NF1REP-M.  Landmark  STSs  in  the  KlAA0563rel  gene  and 
pseudogenes  cited  previously  (Dorschner  et  al.  2000)  are  indicated  above  NF1REP-P1.  Yellow 
boxes  depict  SMURF2- derived  pseudogene  fragments,  and  red  boxes  denote  the  PRS.  (C) 
Comparison  ofNFIREP-Pl  to  NF1REP-E19  located  at  chromosome  19pl3.13.  Colored  boxes 
match  those  described  for  panel  B.  PRS1  and  PRS2  sites  in  NF1REP-E19  are  not  known  to 
serve  as  recombination  substrates,  NF1REP-E19  contains  the  LEC2  functional  gene  and 
NF1REP-P1  contains  pseudogene  fragments  of  LEC2  (purple). 


Figure  2.  Paralogous  recombination  sites  and  sequence  composition  of  the  NF1  REP-5 1 
paralogs.  Nucleotide  coordinates  and  sequence  features  refer  to  the  NF1REP-P1  reference  BAC 
RP1 1-271K11.  Breakpoint  intervals  are  shown  as  red  bars,  with  the  number  of  NF1 
microdeletion  cases  mapped  to  each  interval  indicated  (Dorschner  et  al.  2000;  Lopez-Correa  et 
al.  2001;  Dorschner  et  al.  submitted).  Dotted  arrows  indicate  the  PRS1  and  PRS2  recombination 
hotspots.  The  black  bar  below  PRS2  indicates  the  700  bp  putative  gene  conversion  tract  between 
NF1REP-51  repeats  (see  text  and  Fig.  4).  Patient  UWA160-1  is  a  new  microdeletion  case  that 
maps  within  NF1  REP-51  but  outside  the  PRS  clusters  (see  text).  High-copy  repeats  in  NF1  REP¬ 
P-51  (forward  strand  on  top,  reverse  strand  on  bottom)  and  percent  G+C  are  displayed  using 
graphical  output  from  the  Gestalt  Workbench  (Glusman  and  Lancet  2000).  Bar  size  indicates  the 
relative  repeat  age,  and  colors  indicate  repeat  class  (LINE  elements  in  green,  Alus  in  red,  MIRs 
in  purple,  tRNAs  in  blue  (none  found  here)  and  all  others  in  brown).  Overall  GC  content  is 
48.4%,  and  the  blue-to-red  transition  indicates  an  isochore  transition  from  HI  -2  (43-50  percent 
GC)  to  H3  (>50  percent  GC). 

Figure  3.  Alignment,  sequence  identity,  and  sequence  features  of  the  51  kb  high-identity 
paralogs  NF1REP-P1-51  and  NF1REP-M-51.  Nucleotide  coordinates  refer  to  the  NF1REP-P1 
reference  BAC  RP1 1-271K1 1.  The  top  panel  shows  the  size  of  alignment  gaps  using  reference 
BACs  271K1 1  (NF1REP-P1)  and  640N20  (NF1REP-M).  The  middle  panel  shows  nucleotide 
site  differences  between  REPs  (paralogous  sequence  variants,  or  PS  Vs),  indels  excluded, 
tabulated  with  a  sliding  100  bp  window  stepped  by  25  bp.  Noted  are  two  sites  of  greater 
sequence  difference,  including  a  single,  high-scoring  Matrix  Attachment  Region  (MAR)  and  an 
apparent  gene  conversion  tract  with  variants  matching  NF1REP-E19  (see  Results).  The  location 


of  the  700-bp  fragment  of  perfect  match  with  statistical  evidence  of  gene  conversion  is  indicated. 
Breakpoint  intervals  (red  blocks)  repeat  those  shown  in  Fig.  2.  Blue  arrows  in  bottom  panel  are 
promoter-like  sequences  detected  by  FirstEF  (see  Methods)  but  these  are  not  associated  with 
known  genes. 

Figure  4.  Detailed  structure  of  the  PRS2  region.  Top  panel  with  alignment  gaps  match  those  for 
this  region  shown  in  Fig.  3.  Ninety-three  percent  of  all  PRS2  exchanges  occurred  in  the 
indicated  2.3  kb  hotspot  interval.  The  chart  shows  indels  based  on  the  two  finished  reference 
BACS  (271K1 1  and  640N20),  but  variants  used  for  interval  mapping  (PSVs  and  indels)  were 
based  on  fixed  polymorphisms  in  seven  NF1REP-P1  and  seven  NF1REP-M  clones  for  PRS1  and 
PRS2  from  normal  individuals  (Dorschner  et  al.  2000;  Dorschner  et  al.  submitted).  Hence,  the 
intervals  here  contain  indels  and  site  variants  that  were  not  used  for  breakpoint  mapping.  The 
shared  variants  are  sites  found  polymorphic  in  both  paralogs,  based  on  the  3  proximal  and  2 
medial  Genbank  sequences  (see  Methods).  Simple  sequence  sites  were  found  by  RepeatMasker, 
and  high-scoring  promoter  sequences  were  found  with  FirstEF. 
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Microdeletions  involving  the  entire  NF7  gene  cause  ~5%  of  neurofibromatosis  1  (NF1)  cases 
and  predispose  to  an  early  age  of  onset  and  large  numbers  of  dermal  neurofibromas.  The 
majority  are  recurrent  1.5  Mb  deletions  mediated  by  homologous  recombination  between 
paralogs,  termed  NFlREPs,  that  flank  the  NF1  gene.  Towards  elucidating  the  mechanism(s)  of 
NF1  microdeletion,  we  sequenced  NFIREP-mediated  NF1  microdeletions  and  identified  the 
breakpoint  intervals  by  use  of  paralogous  specific  variants  (PSV).  We  found  that  breakpoints 
clustered  to  two  paralogous  recombination  sites,  termed  PRS1  and  PRS2,  which  are  4,154  and 
6,3 1 5  bp  in  length  and  ~1 5  kb  apart.  Each  PRS  had  a  recombination  hotspot.  AtPRSl  64% 
(9/14)  of  sequenced  PRS1  breakpoints  occurred  in  a  551  bp  hotspot,  while  86%  (25/29)  of  PRS2 
breakpoints  occurred  in  a  2,292  bp  hotspot.  The  breakpoint  frequency  decreased  in  a  generally 
symmetrical  fashion  on  both  sides  of  each  hotspot,  consistent  with  an  initiation  site  for  crossing 
over  in  this  region.  We  developed  PRS1  and  PRS2  deletion  junction-specific  PCR  assays, 
screened  a  cohort  of  NF1  deletion  patients,  and  found  that  69%  (N=78)  of  breakpoints  occurred 
at  the  PRS,  and  of  these  74%  occurred  at  PRS2.  The  PRS  assays  are  highly  specific  reactions 
that  can  be  used  to  define  a  cohort  of  patients  with  the  same  deletion  genotype  for  phenotypic 
studies  and/or  to  provide  clinical  diagnostic  testing. 

Experimental  and  in  silico  genomic  analyses  revealed  that  there  are  four  NFlREPs  in 
the  genome  that  are  complex  modular  assemblies  of  paralogs  from  different  sequence  families. 
NF1REP-P1  and  NF1REP-M,  which  serve  as  recombination  substrates  for  the  recurrent  1.5  Mb 
NF1  microdeletion,  are  133  and  75  kb  in  length  and  share  a  51  kb  region  of  98%  sequence 
identity  in  which  PRS1  and  PRS2  are  located.  Therefore,  despite  51  kb  of  high  sequence 
identity  meiotic  recombination  preferentially  occurs  at  discrete  hotspots  of  <2.2  kb.  NF1  REP- 
PI  is  44  kb  and  located  just  centromeric  to  the  NF1  gene.  It  can  recombine  with  NF1REP-M 
and  result  in  smaller  1  Mb  deletions  involving  NFL  The  fourth  paralog,  NF1REP-E19  is  155  kb 
and  located  at  19pl3.  NF1REP-E19  harbors  both  PRS1  and  PRS2  but  only  fragments  of  the  51 
kb  high  identity  segment  compared  to  NF1REP-P1  and  -M.  It  is  unknown  whether  the  NF1REP 
E19  participates  in  interchromosomal  rearrangements  with  NFlREPs  in  the  NF1  region. 
Together  these  4  NF1REP  elements  are  primarily  comprised  of  low-copy  repeats,  pseudogene 
fragments,  and  two  genes,  KIAA0563-related  gene  and  Lec2 .  Possible  recombinogenic  features 
distinctive  for  each  PRS  separately  include  several  closely  spaced  microsatellites  proximal  to  the 
PRS1  hotspot,  and  a  generally  high  G+C  content  punctuated  by  GA-rich  and  G-rich  islands  for 
the  PRS2  hotspot.  Our  identification  of  PRS  1  and  PRS2  as  preferential  breakpoint  sites  within 
the  surrounding  51  kb  NF1REP  region  of  98%  identity  implies  that  there  is  an  additional 
factor(s),  such  as  a  functional  domain  or  higher  order  chromosomal  feature,  that  influences  the 
site  of  paralogous  recombination  in  the  NF1  region. 
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